JMIR Preprints #72034: Assessment of Large Language Model Performance on Medical School Essay-Style Concept Appraisal Questions

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Assessment of Large Language Model Performance on Medical School Essay-Style Concept Appraisal Questions

Seysha Mehta;
Eliot Haddad;
Indira Bhavsar Burke;
Alana Majors;
Rie Maeda;
Sean M Burke;
Abhishek Deshpande;
Amy Nowacki;
Christina Lindenmeyer;
Neil Mehta

ABSTRACT

Microsoft Copilot, a ChatGPT 4.0 based Large Language Model, demonstrated comparable performance to medical students in answering essay-style CAPPs, while assessors struggled to differentiate AI from human responses. These results highlight the need to prepare students and educators for a future world of AI by fostering reflective learning practices and critical thinking.

Citation

Please cite as:

Mehta S, Haddad E, Burke IB, Majors A, Maeda R, Burke SM, Deshpande A, Nowacki A, Lindenmeyer C, Mehta N

Assessment of Large Language Model Performance on Medical School Essay-Style Concept Appraisal Questions: Exploratory Study

JMIR Med Educ 2025;11:e72034

DOI: 10.2196/72034

PMID: 40523238

PMCID: 12208947

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Education

Date Submitted: Feb 2, 2025

Date Accepted: May 16, 2025

Assessment of Large Language Model Performance on Medical School Essay-Style Concept Appraisal Questions

ABSTRACT

Citation

Copyright