JMIR Preprints #36885: Race/Ethnicity Stratified Analysis of an Artificial Intelligence Based Tool for Skin Condition Diagnosis by Primary Care Physicians and Nurse Practitioners

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)

Race/Ethnicity Stratified Analysis of an Artificial Intelligence Based Tool for Skin Condition Diagnosis by Primary Care Physicians and Nurse Practitioners

Ayush Jain;
David Way;
Vishakha Gupta;
Yi Gao;
Guilherme de Oliveira Marinho;
Jay Hartford;
Rory Sayres;
Kimberley Kanada;
Clara Eng;
Kunal Nagpal;
Karen B. De Salvo;
Greg S. Corrado;
Lily Peng;
Dale R. Webster;
R. Carter Dunn;
David Coz;
Susan J. Huang;
Yun Liu;
Peggy Bui;
Yuan Liu

ABSTRACT

Background:

Many dermatologic cases are first evaluated by primary care physicians (PCPs) or nurse practitioners (NPs).

Objective:

To evaluate an artificial intelligence (AI)-based tool that assists with interpreting dermatologic conditions.

Methods:

We developed an AI-based tool and conducted a randomized multi-reader, multi-case study (20 PCPs, 20 NPs, 1047 retrospective teledermatology cases) to evaluate its utility. Cases were enriched and comprised 120 skin conditions. Readers were recruited to optimize for geographical diversity; the PCPs practiced across 12 states (2-32 years of experience, mean: 11.3) and the NPs practiced across 9 states (2-34 years of experience, mean: 13.1). To avoid memory effects from incomplete washout, each case was read once by each clinician: either with or without AI assistance, with the assignment randomized. The primary analyses evaluated the top-1 agreement, defined as the agreement rate of the clinicians’ primary diagnosis with the reference diagnoses provided by a panel of dermatologists (per case: 3 dermatologists from a pool of 12, practicing across 8 states, 5-13 years of experience [mean: 7.2]). We additionally conducted subgroup analyses stratified by cases’ self-reported race/ethnicity, and measured the performance spread: the maximum performance subtracted by the minimum across subgroups.

Results:

The AI’s standalone top-1 agreement was 63% and AI assistance was significantly associated with higher agreement with reference diagnoses (Figure). For PCPs, the increase in diagnostic agreement was 10% (p<0.001), from 48% to 58%; for NPs, the increase was 12% (p<0.001), from 46% to 58%. When stratified by cases’ self-reported race/ethnicity (Figure), the AI’s performance was 59-62% for Asian / Native Hawaiian / Pacific Islander, Other, and Hispanic / Latino, and 67% for both Black / African American and White subgroups. For the clinicians, AI-assistance associated improvements across subgroups were in the range of 8-12% for PCPs and 8-15% for NPs. The performance spread across subgroups was 5.3% unassisted vs. 6.6% assisted for PCPs, and 5.2% unassisted vs. 6.0% assisted for NPs. In both unassisted and AI-assisted modalities, and for both PCPs and NPs, the subgroup with the highest performance on average was Black / African Americans, though the differences with other subgroups were small and had overlapping confidence intervals.

Conclusions:

AI assistance was associated with significantly improved diagnostic agreement with dermatologists. Across race/ethnicity subgroups, for both PCPs and NPs, the effect of AI assistance remained high at 8-15% and the performance spread was similar at 5-7%.

Citation

Please cite as:

Jain A, Way D, Gupta V, Gao Y, de Oliveira Marinho G, Hartford J, Sayres R, Kanada K, Eng C, Nagpal K, De Salvo KB, Corrado GS, Peng L, Webster DR, Dunn RC, Coz D, Huang SJ, Liu Y, Bui P, Liu Y

Race- and Ethnicity-Stratified Analysis of an Artificial Intelligence–Based Tool for Skin Condition Diagnosis by Primary Care Physicians and Nurse Practitioners

Iproc 2022;8(1):e36885

DOI: 10.2196/36885

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Iproceedings

Date Submitted: Jan 28, 2022

Date Accepted: Jan 28, 2022

Race/Ethnicity Stratified Analysis of an Artificial Intelligence Based Tool for Skin Condition Diagnosis by Primary Care Physicians and Nurse Practitioners

ABSTRACT

Citation

Copyright

JMIR Preprints

Accepted for/Published in: Iproceedings

Date Submitted: Jan 28, 2022

Date Accepted: Jan 28, 2022

Race/Ethnicity Stratified Analysis of an Artificial Intelligence Based Tool for Skin Condition Diagnosis by Primary Care Physicians and Nurse Practitioners

ABSTRACT

Citation

Per the author's request the PDF is not available.

Copyright