JMIR Preprints #54283: The Performance of ChatGPT-4V in the Japanese Medical Licensing Exam: Image and Table Insights: Research Letter

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

The Performance of ChatGPT-4V in the Japanese Medical Licensing Exam: Image and Table Insights: Research Letter

Soshi Takagi;
Masahide Koda;
Takashi Watari

ABSTRACT

The recent introduction of Chat Generative Pre-trained Transformer 4 Vision (ChatGPT-4V) has expanded the capabilities of language models to include image input features, potentially broadening their application in the medical field. This Research Letter evaluates the performance of ChatGPT-4V in interpreting clinical images and tables through the Japanese Medical Licensing Exam (JMLE). Employing the September 25, 2023, version of ChatGPT-4V, the study compared the program's responses to the 117th JMLE against the passing criteria and the average scores of human examinees. While ChatGPT-4V surpassed the passing threshold with an 85.1% correct response rate in essential knowledge and 76.5% in other areas, it fell short in image-based (71.9%) and table-based questions (35.0%), indicating a significant gap compared to human performance. This suggests limitations in the model's image and table interpretation, exacerbated by its lower proficiency in non-Latin characters and potential overreliance on text information. Despite its success in passing the JMLE, the study highlights the need for further development of ChatGPT-4V to enhance its reliability for medical applications, including diagnostics.

Citation

Please cite as:

Takagi S, Koda M, Watari T

The Performance of ChatGPT-4V in Interpreting Images and Tables in the Japanese Medical Licensing Exam

JMIR Med Educ 2024;10:e54283

DOI: 10.2196/54283

PMID: 38787024

PMCID: 11148840

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Education

Date Submitted: Nov 6, 2023

Date Accepted: Apr 22, 2024

The Performance of ChatGPT-4V in the Japanese Medical Licensing Exam: Image and Table Insights: Research Letter

ABSTRACT

Citation

Copyright