Publication

Results of the 2016 International Skin Imaging Collaboration International Symposium on Biomedical Imaging challenge: Comparison of the accuracy of computer algorithms to dermatologists for the diagnosis of melanoma from dermoscopic images

Downloadable Content

Persistent URL
Last modified
  • 05/15/2025
Type of Material
Authors
    Michael A. Marchetti, Memorial Sloan Kettering Cancer CenterNoel C.F. Codella, IBM Research DivisionStephen W. Dusza, Memorial Sloan Kettering Cancer CenterDavid Gutman, Emory UniversityBrian Helba, Kitware IncAadi Kalloo, Memorial Sloan Kettering Cancer CenterNabin Mishra, Stoecker & AssociatesCristina Carrera, University of BarcelonaM. Emre Celebi, University of Central ArkansasJennifer L. DeFazio, Memorial Sloan Kettering Cancer CenterNatalia Jaimes, Aurora Centro Especializado en Cancer del PielAshfaq A. Marghoob, Memorial Sloan Kettering Cancer CenterElizabeth Quigley, Memorial Sloan Kettering Cancer CenterAlon Scope, Memorial Sloan Kettering Cancer CenterOriol Yelamos, Memorial Sloan Kettering Cancer CenterAllan C. Halpern, Memorial Sloan Kettering Cancer Center
Language
  • English
Date
  • 2018-02-01
Publisher
  • Elsevier: 12 months
Publication Version
Copyright Statement
  • © 2017 American Academy of Dermatology, Inc.
License
Final Published Version (URL)
Title of Journal or Parent Work
ISSN
  • 0190-9622
Volume
  • 78
Issue
  • 2
Start Page
  • 270
End Page
  • +
Grant/Funding Information
  • This research was funded in part through the NIH/NCI Cancer Center Support Grant P30 CA008748.
Supplemental Material (URL)
Abstract
  • Background: Computer vision may aid in melanoma detection. Objective: We sought to compare melanoma diagnostic accuracy of computer algorithms to dermatologists using dermoscopic images. Methods: We conducted a cross-sectional study using 100 randomly selected dermoscopic images (50 melanomas, 44 nevi, and 6 lentigines) from an international computer vision melanoma challenge dataset (n = 379), along with individual algorithm results from 25 teams. We used 5 methods (nonlearned and machine learning) to combine individual automated predictions into “fusion” algorithms. In a companion study, 8 dermatologists classified the lesions in the 100 images as either benign or malignant. Results: The average sensitivity and specificity of dermatologists in classification was 82% and 59%. At 82% sensitivity, dermatologist specificity was similar to the top challenge algorithm (59% vs. 62%, P =.68) but lower than the best-performing fusion algorithm (59% vs. 76%, P =.02). Receiver operating characteristic area of the top fusion algorithm was greater than the mean receiver operating characteristic area of dermatologists (0.86 vs. 0.71, P =.001). Limitations: The dataset lacked the full spectrum of skin lesions encountered in clinical practice, particularly banal lesions. Readers and algorithms were not provided clinical data (eg, age or lesion history/symptoms). Results obtained using our study design cannot be extrapolated to clinical practice. Conclusion: Deep learning computer vision systems classified melanoma dermoscopy images with accuracy that exceeded some but not all dermatologists.
Author Notes
  • Allan C. Halpern, MD, Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, 16 East 60th Street, New York, NY 10022, U.S.A. Telephone: 646-888-6012. Fax: 646-227-7274. halperna@mskcc.org
Keywords
Research Categories
  • Computer Science
  • Health Sciences, Medicine and Surgery

Tools

Relations

In Collection:

Items