Accepted for/Published in: JMIR AI
Date Submitted: Sep 16, 2024
Date Accepted: Jul 5, 2025
Date Submitted to PubMed: Jul 6, 2025
Deep Learning Multi Modal Melanoma Detection: Algorithm Development and Validation
ABSTRACT
The visual similarity of melanoma and seborrheic keratosis has made it difficult for elderly patients with disabilities to know when to seek medical attention, contributing to the metastasis of melanoma. In this paper, we present a novel multi-modal deep learning-based technique to distinguish between melanoma and seborrheic keratosis. Our strategy is three-fold: (1) utilize patient image data to train and test three deep learning models using transfer learning (ResNet50, InceptionV3, and VGG16) and one author designed model, (2) use patient metadata to train and test a deep learning model, and (3) combine the predictions of the image model with the best accuracy and the metadata model, using nonlinear least squares regression to specify ideal weights to each model for a combined prediction. The accuracy of the combined model was 88% on test data from the HAM10000 dataset. Model reliability was assessed by visualizing the output activation map of each model and comparing the diagnosis patterns to that of dermatologists. The addition of metadata to the image dataset was key to reducing the false negative and false positive rate simultaneously, thereby producing better metrics and improving overall model accuracy. Results from this experiment could be used to eliminate late diagnosis of melanoma via easy access to an app. Seeking early attention is vital to prevent metastasis. Future experiments can utilize text data (subjective data pertaining to how the patient felt over a certain period of time) to allow this model to reflect the real hospital setting to a greater extent.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.