IDEAS home Printed from https://ideas.repec.org/a/plo/pmed00/1003381.html

Assessment of deep neural networks for the diagnosis of benign and malignant skin neoplasms in comparison with dermatologists: A retrospective validation study

Author

Listed:
  • Seung Seog Han
  • Ik Jun Moon
  • Seong Hwan Kim
  • Jung-Im Na
  • Myoung Shin Kim
  • Gyeong Hun Park
  • Ilwoo Park
  • Keewon Kim
  • Woohyung Lim
  • Ju Hee Lee
  • Sung Eun Chang

Abstract

Background: The diagnostic performance of convolutional neural networks (CNNs) for diagnosing several types of skin neoplasms has been demonstrated as comparable with that of dermatologists using clinical photography. However, the generalizability should be demonstrated using a large-scale external dataset that includes most types of skin neoplasms. In this study, the performance of a neural network algorithm was compared with that of dermatologists in both real-world practice and experimental settings. Methods and findings: To demonstrate generalizability, the skin cancer detection algorithm (https://rcnn.modelderm.com) developed in our previous study was used without modification. We conducted a retrospective study with all single lesion biopsied cases (43 disorders; 40,331 clinical images from 10,426 cases: 1,222 malignant cases and 9,204 benign cases); mean age (standard deviation [SD], 52.1 [18.3]; 4,701 men [45.1%]) were obtained from the Department of Dermatology, Severance Hospital in Seoul, Korea between January 1, 2008 and March 31, 2019. Using the external validation dataset, the predictions of the algorithm were compared with the clinical diagnoses of 65 attending physicians who had recorded the clinical diagnoses with thorough examinations in real-world practice. Conclusions: Our algorithm could diagnose skin tumors with nearly the same accuracy as a dermatologist when the diagnosis was performed solely with photographs. However, as a result of limited data relevancy, the performance was inferior to that of actual medical examination. To achieve more accurate predictive diagnoses, clinical information should be integrated with imaging information. Seung Seog Han and colleagues compare the performance of a neural network algorithm with that of dermatologists in diagnosis of skin neoplasms.Why was this study done?: What did the researchers do and find?: What do these findings mean?:

Suggested Citation

  • Seung Seog Han & Ik Jun Moon & Seong Hwan Kim & Jung-Im Na & Myoung Shin Kim & Gyeong Hun Park & Ilwoo Park & Keewon Kim & Woohyung Lim & Ju Hee Lee & Sung Eun Chang, 2020. "Assessment of deep neural networks for the diagnosis of benign and malignant skin neoplasms in comparison with dermatologists: A retrospective validation study," PLOS Medicine, Public Library of Science, vol. 17(11), pages 1-21, November.
  • Handle: RePEc:plo:pmed00:1003381
    DOI: 10.1371/journal.pmed.1003381
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosmedicine/article?id=10.1371/journal.pmed.1003381
    Download Restriction: no

    File URL: https://journals.plos.org/plosmedicine/article/file?id=10.1371/journal.pmed.1003381&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pmed.1003381?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Andre Esteva & Brett Kuprel & Roberto A. Novoa & Justin Ko & Susan M. Swetter & Helen M. Blau & Sebastian Thrun, 2017. "Dermatologist-level classification of skin cancer with deep neural networks," Nature, Nature, vol. 542(7639), pages 115-118, February.
    2. Pranav Rajpurkar & Jeremy Irvin & Robyn L Ball & Kaylie Zhu & Brandon Yang & Hershel Mehta & Tony Duan & Daisy Ding & Aarti Bagul & Curtis P Langlotz & Bhavik N Patel & Kristen W Yeom & Katie Shpanska, 2018. "Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists," PLOS Medicine, Public Library of Science, vol. 15(11), pages 1-17, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Young Jae Kim & Jung-Im Na & Seung Seog Han & Chong Hyun Won & Mi Woo Lee & Jung-Won Shin & Chang-Hun Huh & Sung Eun Chang, 2022. "Augmenting the accuracy of trainee doctors in diagnosing skin lesions suspected of skin neoplasms in a real-world setting: A prospective controlled before-and-after study," PLOS ONE, Public Library of Science, vol. 17(1), pages 1-11, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Oded Rotem & Tamar Schwartz & Ron Maor & Yishay Tauber & Maya Tsarfati Shapiro & Marcos Meseguer & Daniella Gilboa & Daniel S. Seidman & Assaf Zaritsky, 2024. "Visual interpretability of image-based classification models by generative latent space disentanglement applied to in vitro fertilization," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    2. Majd Oteibi & Adam Tamimi & Kaneez Abbas & Gabriel Tamimi & Danesh Khazaei & Hadi Khazaei, 2024. "Advancing Digital Health using AI and Machine Learning Solutions for Early Ultrasonic Detection of Breast Disorders in Women," International Journal of Research and Scientific Innovation, International Journal of Research and Scientific Innovation (IJRSI), vol. 11(11), pages 518-527, November.
    3. Syed Ibrar Hussain & Elena Toscano, 2025. "Enhancing Recognition and Categorization of Skin Lesions with Tailored Deep Convolutional Networks and Robust Data Augmentation Techniques," Mathematics, MDPI, vol. 13(9), pages 1-36, April.
    4. von Walter, Benjamin & Wentzel, Daniel & Raff, Stefan, 2023. "Should service firms introduce algorithmic advice to their existing customers? The moderating effect of service relationships," Journal of Retailing, Elsevier, vol. 99(2), pages 280-296.
    5. Sidra Mehboob, Maryam Bukhari, Yaser Ali Shah, SalabatKhan, MuhammadSharif, 2025. "Enhanced Skin Cancer Classification with MobileNetV3 and Morphological Preprocessing: A Deep Learning-Based Extension," International Journal of Innovations in Science & Technology, 50sea, vol. 7(7), pages 1-12, May.
    6. Han Li & Feng Tian, 2026. "Advancing Decision-Making through AI-Human Collaboration: A Systematic Review and Conceptual Framework," Group Decision and Negotiation, Springer, vol. 35(2), pages 1-24, June.
    7. Riccardo Zanardelli, 2025. "Navigating the safe harbor paradox in human-machine systems," Papers 2509.14057, arXiv.org, revised Jan 2026.
    8. repec:bjc:journl:v:12:y:2025:i:9:p:2881-2888 is not listed on IDEAS
    9. Lin Lu & Laurent Dercle & Binsheng Zhao & Lawrence H. Schwartz, 2021. "Deep learning for the prediction of early on-treatment response in metastatic colorectal cancer from serial medical imaging," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    10. Sangwon Chae & Sungjun Kwon & Donghyun Lee, 2018. "Predicting Infectious Disease Using Deep Learning and Big Data," IJERPH, MDPI, vol. 15(8), pages 1-20, July.
    11. Kita-Wojciechowska Kinga & Kidziński Łukasz, 2019. "Google Street View image predicts car accident risk," Central European Economic Journal, Sciendo, vol. 6(53), pages 151-163, January.
    12. Zheng Yan & Wenqian Robertson & Yaosheng Lou & Tom W. Robertson & Sung Yong Park, 2021. "Finding leading scholars in mobile phone behavior: a mixed-method analysis of an emerging interdisciplinary field," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(12), pages 9499-9517, December.
    13. Songhee Cheon & Jungyoon Kim & Jihye Lim, 2019. "The Use of Deep Learning to Predict Stroke Patient Mortality," IJERPH, MDPI, vol. 16(11), pages 1-12, May.
    14. Marcus Buckmann & Andy Haldane & Anne-Caroline Hüser, 2021. "Comparing minds and machines: implications for financial stability," Oxford Review of Economic Policy, Oxford University Press and Oxford Review of Economic Policy Limited, vol. 37(3), pages 479-508.
    15. Freddy Gabbay & Rotem Lev Aharoni & Ori Schweitzer, 2022. "Deep Neural Network Memory Performance and Throughput Modeling and Simulation Framework," Mathematics, MDPI, vol. 10(21), pages 1-20, November.
    16. Matteo D’Antonio & Wilfredo G. Gonzalez Rivera & Robert A. Greenes & Melissa Gymrek & Kelly A. Frazer, 2025. "A highly accurate risk factor-based XGBoost multiethnic model for identifying patients with skin cancer," Nature Communications, Nature, vol. 16(1), pages 1-15, December.
    17. Sebastian Gehrmann & Franck Dernoncourt & Yeran Li & Eric T Carlson & Joy T Wu & Jonathan Welt & John Foote Jr. & Edward T Moseley & David W Grant & Patrick D Tyler & Leo A Celi, 2018. "Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives," PLOS ONE, Public Library of Science, vol. 13(2), pages 1-19, February.
    18. Severin Rodler & Gerald Schulz & Alexander Buchner & Christian Stief & Michael Staehler & Jozefina Casuscelli, 2019. "The Role of Digital Biomarkers in Cancer Research and Patient Care," Biomedical Journal of Scientific & Technical Research, Biomedical Research Network+, LLC, vol. 17(3), pages 12870-12872, April.
    19. Ting Wang & Boyang Zang & Chui Kong & Yigang Li & Xiaomin Yang & Yi Yu, 2025. "Intelligent and precise auxiliary diagnosis of breast tumors using deep learning and radiomics," PLOS ONE, Public Library of Science, vol. 20(6), pages 1-11, June.
    20. Kai Feng & Han Hong & Ke Tang & Jingyuan Wang, 2025. "Statistical Tests for Replacing Human Decision Makers with Algorithms," Management Science, INFORMS, vol. 71(11), pages 9145-9170, November.
    21. Sonika Darshan, 2024. "Data Mining for Disease Diagnosis: A Review of Machine Learning Approaches in Healthcare," Journal of Artificial Intelligence General science (JAIGS) ISSN:3006-4023, Open Knowledge, vol. 6(1), pages 716-726.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pmed00:1003381. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosmedicine (email available below). General contact details of provider: https://journals.plos.org/plosmedicine/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.