JMIR Preprints #85091: Performance of Large Language Models versus Traditional Chinese Doctors in Migraine Diagnosis and Herbal Prescription: Approaching a Turing Point

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Performance of Large Language Models versus Traditional Chinese Doctors in Migraine Diagnosis and Herbal Prescription: Approaching a Turing Point

Keming Yan;
Yuanyuan Li;
Yu Liu;
Kerun Li;
Jiani Wu;
Jian Kong

ABSTRACT

Background:

Although Traditional Chinese Medicine (TCM) is gaining global recognition, the availability of highly trained experts remains limited. The individualized and dynamically complex nature of TCM herbal formulations presents challenges unmatched in Western medicine.

Objective:

This study evaluates whether advanced, publicly accessible large language models (LLMs) can approximate the diagnostic reasoning and herbal prescription practices of experienced TCM doctors, using a standardized migraine case from a published report.

Methods:

Nine LLMs (eight reasoning-enabled “thinking” models and one non-thinking baseline) were prompted through publicly available interfaces using a structured five-task format (Western diagnosis, TCM diagnosis, treatment principle, herbal prescription, and preventive care). No fine-tuning or external knowledge bases were applied. For comparison, diagnoses and prescriptions from three TCM doctors, including the original case author, were generated using the same prompt. Thirty-two expert raters, blinded to response source, independently scored all outputs across five evaluation dimensions.

Results:

Most reasoning-enabled LLMs achieved performance comparable to that of senior TCM physicians, with several models (e.g., GPT-o3, Qwen-3, Gemini-2.5 Pro) receiving significantly higher scores than one or more physicians in specific tasks. By contrast, the non-thinking baseline model (Aya Expense 32B) performed substantially worse.

Conclusions:

Advanced LLMs may have reached a “Turing point” in TCM, rivaling expert performance. While promising for addressing practitioner shortages, these advances highlight the urgent need for careful regulation and oversight.

Citation

Please cite as:

Yan K, Li Y, Liu Y, Li K, Wu J, Kong J

Performance of Large Language Models versus Traditional Chinese Doctors in Migraine Diagnosis and Herbal Prescription: Approaching a Turing Point

JMIR Preprints. 30/09/2025:85091

DOI: 10.2196/preprints.85091

URL: https://preprints.jmir.org/preprint/85091

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Previously submitted to: Journal of Medical Internet Research (no longer under consideration since Mar 30, 2026)

Date Submitted: Sep 30, 2025

Open Peer Review Period: Oct 1, 2025 - Nov 26, 2025

(closed for review but you can still tweet)

NOTE: This is an unreviewed Preprint

Performance of Large Language Models versus Traditional Chinese Doctors in Migraine Diagnosis and Herbal Prescription: Approaching a Turing Point

ABSTRACT

Citation

Copyright