Accepted for/Published in: JMIR Medical Education
Date Submitted: Nov 6, 2023
Date Accepted: Apr 22, 2024
The Performance of ChatGPT-4V in the Japanese Medical Licensing Exam: Image and Table Insights: Research Letter
ABSTRACT
The recent introduction of Chat Generative Pre-trained Transformer 4 Vision (ChatGPT-4V) has expanded the capabilities of language models to include image input features, potentially broadening their application in the medical field. This Research Letter evaluates the performance of ChatGPT-4V in interpreting clinical images and tables through the Japanese Medical Licensing Exam (JMLE). Employing the September 25, 2023, version of ChatGPT-4V, the study compared the program's responses to the 117th JMLE against the passing criteria and the average scores of human examinees. While ChatGPT-4V surpassed the passing threshold with an 85.1% correct response rate in essential knowledge and 76.5% in other areas, it fell short in image-based (71.9%) and table-based questions (35.0%), indicating a significant gap compared to human performance. This suggests limitations in the model's image and table interpretation, exacerbated by its lower proficiency in non-Latin characters and potential overreliance on text information. Despite its success in passing the JMLE, the study highlights the need for further development of ChatGPT-4V to enhance its reliability for medical applications, including diagnostics.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.