Synapse - Expert-level diagnosis of nonpigmented skin cancer by combined convolutional neural networks

Expert-level diagnosis of nonpigmented skin cancer by combined convolutional neural networks Journal Article

Authors:	Tschandl, P.; Rosendahl, C.; Akay, B. N.; Argenziano, G.; Blum, A.; Braun, R. P.; Cabo, H.; Gourhant, J. Y.; Kreusch, J.; Lallas, A.; Lapins, J.; Marghoob, A.; Menzies, S.; Neuber, N. M.; Paoli, J.; Rabinovitz, H. S.; Rinner, C.; Scope, A.; Soyer, H. P.; Sinz, C.; Thomas, L.; Zalaudek, I.; Kittler, H.
Article Title:	Expert-level diagnosis of nonpigmented skin cancer by combined convolutional neural networks
Abstract:	Importance: Convolutional neural networks (CNNs) achieve expert-level accuracy in the diagnosis of pigmented melanocytic lesions. However, the most common types of skin cancer are nonpigmented and nonmelanocytic, and are more difficult to diagnose. Objective: To compare the accuracy of a CNN-based classifier with that of physicians with different levels of experience. Design, Setting, and Participants: A CNN-based classification model was trained on 7895 dermoscopic and 5829 close-up images of lesions excised at a primary skin cancer clinic between January 1, 2008, and July 13, 2017, for a combined evaluation of both imaging methods. The combined CNN (cCNN) was tested on a set of 2072 unknown cases and compared with results from 95 human raters who were medical personnel, including 62 board-certified dermatologists, with different experience in dermoscopy. Main Outcomes and Measures: The proportions of correct specific diagnoses and the accuracy to differentiate between benign and malignant lesions measured as an area under the receiver operating characteristic curve served as main outcome measures. Results: Among 95 human raters (51.6% female; mean age, 43.4 years; 95% CI, 41.0-45.7 years), the participants were divided into 3 groups (according to years of experience with dermoscopy): beginner raters (<3 years), intermediate raters (3-10 years), or expert raters (>10 years). The area under the receiver operating characteristic curve of the trained cCNN was higher than human ratings (0.742; 95% CI, 0.729-0.755 vs 0.695; 95% CI, 0.676-0.713; P <.001). The specificity was fixed at the mean level of human raters (51.3%), and therefore the sensitivity of the cCNN (80.5%; 95% CI, 79.0%-82.1%) was higher than that of human raters (77.6%; 95% CI, 74.7%-80.5%). The cCNN achieved a higher percentage of correct specific diagnoses compared with human raters (37.6%; 95% CI, 36.6%-38.4% vs 33.5%; 95% CI, 31.5%-35.6%; P =.001) but not compared with experts (37.3%; 95% CI, 35.7%-38.8% vs 40.0%; 95% CI, 37.0%-43.0%; P =.18). Conclusions and Relevance: Neural networks are able to classify dermoscopic and close-up images of nonpigmented lesions as accurately as human experts in an experimental setting. © 2018 American Medical Association. All rights reserved.
Keywords:	adult; controlled study; major clinical study; area under the curve; validation process; diagnostic accuracy; sensitivity and specificity; skin cancer; epiluminescence microscopy; receiver operating characteristic; medical personnel; artificial neural network; dermatologist; human; male; female; priority journal; article; convolutional neural network
Journal Title:	JAMA Dermatology
Volume:	155
Issue:	1
ISSN:	2168-6068
Publisher:	American Medical Association
Date Published:	2019-01-01
Start Page:	58
End Page:	65
Language:	English
DOI:	10.1001/jamadermatol.2018.4378
PUBMED:	30484822
PROVIDER:	scopus
PMCID:	PMC6439580
DOI/URL:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85057833474&doi=10.1001%2fjamadermatol.2018.4378&partnerID=40&md5=6f45ddcefc448b216876019ff4b24e3d
Notes:	Article -- Export Date: 1 March 2019 -- Source: Scopus

Altmetric

What is Altmetric?

Citation Impact

What is Dimensions Citation Badge?

BMJ Impact Analytics

MSK Authors

534 Marghoob

Related MSK Work

Artificial Intelligence In Skin Cancer

Current Dermatology Reports 2019
Accuracy Of Dermatoscopy For The Diagnosis Of Nonpigmented Cancers Of The Skin

Journal of the American Academy of Dermatology 2017
Man Against Machine: Diagnostic Performance Of A Deep Learning Convolutional Neural Network For Dermoscopic Melanoma Recognition In Comparison To 58 Dermatologists

Annals of Oncology 2018
Association Of Shiny White Blotches And Strands With Nonpigmented Basal Cell Carcinoma Evaluation Of An Additional Dermoscopic Diagnostic Criterion

JAMA Dermatology 2016
Dermoscopy Training Effect On Diagnostic Accuracy Of Skin Lesions In Canadian Family Medicine Physicians Using The Triage Amalgamated Dermoscopic Algorithm

Dermatology Practical & Conceptual 2020