JMIR Preprints #92867: Application of Multimodal Large Language Models in Cutaneous Lesion Recognition of Talaromycosis and Cryptococcosis

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Application of Multimodal Large Language Models in Cutaneous Lesion Recognition of Talaromycosis and Cryptococcosis

Yun Chen;
Shiyi Lai;
Huanlan Yang;
Meng Zhang;
Xiaoting Xie;
Weihong Huang;
Jinduan Wei;
Zongxiang Yuan;
Yuyuan Huang;
Junjun Jiang;
Li Ye;
Hao Liang;
Wudi Wei

ABSTRACT

Background:

Talaromycosis and cryptococcosis are prevalent in Southern China and Southeast Asia and are frequently misclassified due to overlapping lesion morphology and limited access to confirmatory testing.

Objective:

To evaluate the zero-shot diagnostic performance of multimodal large language models in identifying and differentiating cutaneous lesions of talaromycosis and cryptococcosis

Methods:

Published clinical photographs of cutaneous lesions of talaromycosis and cryptococcosis were systematically retrieved up to 31 August 2025, and seven representative multimodal large language models were benchmarked under a strictly zero-shot setting using a standardized prompt template and a predefined output schema. Latency, unanswerable/invalid response rates, and diagnostic performance were evaluated using accuracy, precision, sensitivity, specificity, F1-score, and Matthews correlation coefficient. For explanation quality assessment, model-generated texts were independently rated by two clinicians across five dimensions, and hallucination events were quantified.

Results:

In total, 214 articles (95 for talaromycosis and 119 for cryptococcosis), including 244 talaromycosis cutaneous lesion images and 236 cryptococcosis cutaneous lesion images, were collected for zero-shot evaluation. Most models achieved acceptable performance recognition, among them, ChatGPT-5 achieved the best performance. For comprehensive performance comparison, ChatGPT-5 ranked first across six indicators but exhibited relatively lower sensitivity. Evaluation of the output text quality demonstrated that the diagnostic texts generated by GPT-5 were excellent. The EQI was 70.08, with a hallucination rate of 21.76%.

Conclusions:

ChatGPT-5 demonstrates feasibility in the recognition of cutaneous lesions of talaromycosis and cryptococcosis under zero-shot conditions and can serve as a potential tool for assisting in the analysis of infectious skin disease images.

Citation

Please cite as:

Chen Y, Lai S, Yang H, Zhang M, Xie X, Huang W, Wei J, Yuan Z, Huang Y, Jiang J, Ye L, Liang H, Wei W

Application of Multimodal Large Language Models in Cutaneous Lesion Recognition of Talaromycosis and Cryptococcosis

JMIR Preprints. 04/02/2026:92867

DOI: 10.2196/preprints.92867

URL: https://preprints.jmir.org/preprint/92867

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Currently submitted to: Transfer Hub (manuscript eXchange)

Date Submitted: Feb 4, 2026

Open Peer Review Period: Feb 5, 2026 - Apr 2, 2026

(closed for review but you can still tweet)

NOTE: This is an unreviewed Preprint

Application of Multimodal Large Language Models in Cutaneous Lesion Recognition of Talaromycosis and Cryptococcosis

ABSTRACT

Citation

Copyright