About this item:

64 Views | 47 Downloads

Author Notes:

Fred Prior

Department of Biomedical Informatics, University of Arkansas for Medical Sciences, 4301 W. Markham St, Little Rock, Arkansas, 72205, USA.

Tel.: +1 501 686 5966

fwprior@uams.edu

Author Contributions: 1. guarantor of integrity of the entire study - Fred Prior; 2. study concepts and design – Fred Prior, Kirk Smith; 3. literature research - All Authors; 4. clinical studies - N/A; 5. experimental studies / data analysis - N/A; 6. statistical analysis - N/A; 7. manuscript preparation - All Authors; 8. manuscript editing - Fred Prior.

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Subjects:

Research Funding:

This work was supported in part by the National Cancer Institute, National Institutes of Health contract no. HHSN261200800001E, subcontract 16X011; National Cancer Institute 1U01CA187013 and 1U24CA215109.

Keywords:

  • Science & Technology
  • Life Sciences & Biomedicine
  • Radiology, Nuclear Medicine & Medical Imaging
  • COMPUTER-AIDED DIAGNOSIS
  • MEDICAL IMAGES
  • LUNG NODULES
  • RADIOMICS
  • MANAGEMENT
  • CAD
  • IDENTIFICATION
  • INFORMATION
  • VALIDATION
  • ALGORITHM

Open access image repositories: high-quality data to enable machine learning research

Tools:

Journal Title:

CLINICAL RADIOLOGY

Volume:

Volume 75, Number 1

Publisher:

, Pages 7-12

Type of Work:

Article | Post-print: After Peer Review

Abstract:

Originally motivated by the need for research reproducibility and data reuse, large-scale, open access information repositories have become key resources for training and testing of advanced machine learning applications in biomedical and clinical research. To be of value, such repositories must provide large, high-quality data sets, where quality is defined as minimising variance due to data collection protocols and data misrepresentations. Curation is the key to quality. We have constructed a large public access image repository, The Cancer Imaging Archive, dedicated to the promotion of open science to advance the global effort to diagnose and treat cancer. Drawing on this experience and our experience in applying machine learning techniques to the analysis of radiology and pathology image data, we will review the requirements placed on such information repositories by state-of-the-art machine learning applications and how these requirements can be met.

Copyright information:

2019

This is an Open Access work distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/rdf).
Export to EndNote