Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jun 22, 2020
Date Accepted: Feb 8, 2021

The final, peer-reviewed published version of this preprint can be found here:

Reducing the Impact of Confounding Factors on Skin Cancer Classification via Image Segmentation: Technical Model Study

Maron RC, Hekler A, Krieghoff-Henning E, Schmitt M, Schlager JG, Utikal JS, Brinker TJ

Reducing the Impact of Confounding Factors on Skin Cancer Classification via Image Segmentation: Technical Model Study

J Med Internet Res 2021;23(3):e21695

DOI: 10.2196/21695

PMID: 33764307

PMCID: 8074854

Reducing Confounding Factors’ Impact on Skin Cancer Classification via Image Segmentation: Technical Model Study

  • Roman Christoph Maron; 
  • Achim Hekler; 
  • Eva Krieghoff-Henning; 
  • Max Schmitt; 
  • Justin Gabriel Schlager; 
  • Jochen Sven Utikal; 
  • Titus Josef Brinker

ABSTRACT

Background:

Studies have shown that artificial intelligence achieves similar or better performance than dermatologists in specific dermoscopic image classification tasks. It is however susceptible to the influence of confounding factors within images (e.g. skin markings), which can lead to false diagnoses of cancerous skin lesions. Image segmentation can remove lesion-adjacent confounding factors but greatly changes the image’s representation.

Objective:

Compare the performance of binary skin lesion classifiers trained on segmented or unsegmented dermoscopic images and determine if and how the segmentation process affects their skin lesion classification performance.

Methods:

Separate binary skin lesion classifiers (nevus vs. melanoma) were trained and evaluated on segmented and unsegmented dermoscopic images. For a more informative result, classifiers were trained on two distinct training datasets (HAM and ISIC). Each training run was repeated five times, resulting in an ensemble of classifiers for each training dataset and preprocessing method. Mean ensemble performance was evaluated on a multi-source test set (n=688) consisting of a holdout and an external component.

Results:

Segmented HAM classifiers showed a higher mean balanced accuracy of 75.6% ±1.1% compared to 66.7% ±3.2% (unsegmented), which was significant in four out of five runs (P<.001). ISIC classifiers showed a comparable mean balanced accuracy of 77.4% ±1.5% (segmented) and 78.3% ±1.8% (unsegmented), which was significantly different in one out of five runs (P=.004).

Conclusions:

Image segmentation does not result in an overall performance decrease and additionally causes the beneficial removal of lesion-adjacent confounding factors. Thus it is a viable option to address the negative impact confounding factors have on DL-models in dermatology. However, the segmentation step might introduce new pitfalls, which requires future investigation.


 Citation

Please cite as:

Maron RC, Hekler A, Krieghoff-Henning E, Schmitt M, Schlager JG, Utikal JS, Brinker TJ

Reducing the Impact of Confounding Factors on Skin Cancer Classification via Image Segmentation: Technical Model Study

J Med Internet Res 2021;23(3):e21695

DOI: 10.2196/21695

PMID: 33764307

PMCID: 8074854

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.