JMIR Preprints #74040: Evaluating Artificial Intelligence Models in Dermatology: A Comparative Analysis

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Evaluating Artificial Intelligence Models in Dermatology: A Comparative Analysis

Aneri B Patel;
William Driscoll;
Conan H Lee;
Cameron Zachary;
Nicole M Golbari;
Janellen Smith

ABSTRACT

Background:

Dermatology is an ever-evolving field; the integration of artificial intelligence (AI) offers promising avenues for enhancing clinical decision-making and improving patient outcomes. This study focuses on evaluating the effectiveness of DermGPT, an AI model specifically designed to address dermatological inquiries with precision, compared to ChatGPT-4o (OpenAI), an AI chatbot and virtual assistant developed by OpenAI. DermGPT was developed by a practicing dermatologist, and it is marketed as a superior alternative to general-purpose AI models like ChatGPT; it contains a curated database of authoritative sources tailored to the community of dermatological professionals.

Objective:

To assess the validity of these claims, we conducted a comparative analysis between ChatGPT and DermGPT.

Methods:

We conducted a survey study. A series of 15 dermatology-specific questions were presented to both AI tools, with the responses subsequently evaluated by a cohort of 19+ practicing dermatologists (attendings, residents and fellows) across two institutions. The dermatologists, who were blinded to the origin of the answers, rated the quality and accuracy of the responses in the form of a survey. Preferences for answers and comparison of sources references were evaluated, and the efficacy of the sources of data were compared. Statistical analysis was performed using SAS OnDemand. Univariable analysis was performed for categorical variables via chi-square tests, with statistical significance set at p<0.05.

Results:

A total of 19 attendings, dermatology residents and fellows completed the survey. For answer preference, DermGPT responses were favored overall (48%) compared to ChatGPT (28%), with a statistically significant chi-square test result (p=0.039). In subgroup analysis, attendings preferred DermGPT (47%) over ChatGPT (28%), while residents showed a narrower preference margin (32% vs. 31%). For source preference, ChatGPT citations were favored (46%) over DermGPT (24%), though the chi-square test did not reach statistical significance (p=1.385). Attendings preferred ChatGPT sources (48%) over DermGPT (23%), while residents also favored ChatGPT (41%) over DermGPT (24%).

Conclusions:

DermGPT demonstrated strong potential for improving answer clarity and conciseness in dermatology-related queries, while ChatGPT provided more robust source citations, enhancing trust in evidence-based responses. Notably, ChatGPT’s citation quality was preferred, whereas DermGPT’s concise style made it more accessible for quick clinical reference. These findings suggest that integrating the strengths of both models could optimize AI-assisted medical consultations, balancing clarity with academic rigor. Clinical Trial: N/A

Citation

Please cite as:

Patel AB, Driscoll W, Lee CH, Zachary C, Golbari NM, Smith J

Evaluating Artificial Intelligence Models in Dermatology: Comparative Analysis

JMIR Dermatol 2025;8:e74040

DOI: 10.2196/74040

PMID: 41344880

PMCID: 12677980

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Dermatology

Date Submitted: Mar 16, 2025

Date Accepted: Nov 2, 2025

Evaluating Artificial Intelligence Models in Dermatology: A Comparative Analysis

ABSTRACT

Citation

Copyright