Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: Jun 22, 2020
Date Accepted: Feb 8, 2021
Reducing Confounding Factors’ Impact on Skin Cancer Classification via Image Segmentation: Technical Model Study
ABSTRACT
Background:
Studies have shown that artificial intelligence achieves similar or better performance than dermatologists in specific dermoscopic image classification tasks. It is however susceptible to the influence of confounding factors within images (e.g. skin markings), which can lead to false diagnoses of cancerous skin lesions. Image segmentation can remove lesion-adjacent confounding factors but greatly changes the image’s representation.
Objective:
Compare the performance of binary skin lesion classifiers trained on segmented or unsegmented dermoscopic images and determine if and how the segmentation process affects their skin lesion classification performance.
Methods:
Separate binary skin lesion classifiers (nevus vs. melanoma) were trained and evaluated on segmented and unsegmented dermoscopic images. For a more informative result, classifiers were trained on two distinct training datasets (HAM and ISIC). Each training run was repeated five times, resulting in an ensemble of classifiers for each training dataset and preprocessing method. Mean ensemble performance was evaluated on a multi-source test set (n=688) consisting of a holdout and an external component.
Results:
Segmented HAM classifiers showed a higher mean balanced accuracy of 75.6% ±1.1% compared to 66.7% ±3.2% (unsegmented), which was significant in four out of five runs (P<.001). ISIC classifiers showed a comparable mean balanced accuracy of 77.4% ±1.5% (segmented) and 78.3% ±1.8% (unsegmented), which was significantly different in one out of five runs (P=.004).
Conclusions:
Image segmentation does not result in an overall performance decrease and additionally causes the beneficial removal of lesion-adjacent confounding factors. Thus it is a viable option to address the negative impact confounding factors have on DL-models in dermatology. However, the segmentation step might introduce new pitfalls, which requires future investigation.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.