JMIR Preprints #58897: Can ChatGPT-4 Pass the Licensing Examinations of Traditional Chinese Medicine? A Cross-sectional Study in Taiwan

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Can ChatGPT-4 Pass the Licensing Examinations of Traditional Chinese Medicine? A Cross-sectional Study in Taiwan

Liang-Wei Tseng;
Yi-Chin Lu;
Liang-Chi Tseng;
Hsing-Yu Chen;
Yu-Chun Chen

ABSTRACT

Background:

The integration of artificial intelligence (AI), notably Chat Generative Pre-trained Transformer (ChatGPT), into medical education, has shown promising results in various medical fields. Nevertheless, its efficacy in Traditional Chinese Medicine (TCM) examinations remains understudied.

Objective:

This study aims to (1) assess the performance of ChatGPT on the TCM licensing examination in Taiwan, and (2) explore its potential as a learning tool and its understanding of TCM principles.

Methods:

We used the GPT-4 model to respond to 480 questions from the 2022 TCM licensing examination. This study compared the performance of the model against that of licensed TCM doctors using two approaches, namely direct answer selection and provision of explanations before answer selection. The accuracy and consistency of AI-generated responses were analyzed. Moreover, a breakdown of question characteristics was performed based on the cognitive level, depth of knowledge, types of questions, vignette style, and polarity of questions.

Results:

ChatGPT achieved an overall accuracy of 43.9%, which was lower than that of two human participants (70% and 78.4%). The analysis did not reveal a significant correlation between the accuracy of the model and the characteristics of the questions. An in-depth examination indicated that errors predominantly resulted from misunderstanding of TCM concepts (55.3%), emphasizing the limitations of the model with regard to TCM knowledge base and reasoning capability.

Conclusions:

While ChatGPT shows promise as an educational tool, its current performance on TCM licensing examinations is lacking. This highlights the need for enhancing AI models with specialized TCM training and suggests a cautious approach to utilize AI for TCM education. Future research should focus on model improvement and the development of tailored educational applications to support TCM learning.

Citation

Please cite as:

Tseng LW, Lu YC, Tseng LC, Chen HY, Chen YC

Performance of ChatGPT-4 on Taiwanese Traditional Chinese Medicine Licensing Examinations: Cross-Sectional Study

JMIR Med Educ 2025;11:e58897

DOI: 10.2196/58897

PMID: 40106227

PMCID: 11939018

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Education

Date Submitted: Mar 27, 2024

Date Accepted: Nov 9, 2024

Can ChatGPT-4 Pass the Licensing Examinations of Traditional Chinese Medicine? A Cross-sectional Study in Taiwan

ABSTRACT

Citation

Copyright