Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Jul 28, 2025
Date Accepted: Dec 17, 2025

The final, peer-reviewed published version of this preprint can be found here:

Large Language Models in Patient Health Communication for Atherosclerotic Cardiovascular Disease: Pilot Cross-Sectional Comparative Analysis

Li P, Xu Y, Liu X, Shen Z, Wang Y, Lv X, Lu Z, Wu H, Zhuang J, Chen Y

Large Language Models in Patient Health Communication for Atherosclerotic Cardiovascular Disease: Pilot Cross-Sectional Comparative Analysis

JMIR Med Inform 2026;14:e81422

DOI: 10.2196/81422

PMID: 41499171

PMCID: 12824577

Large Language Models in Patient Health Communication for Atherosclerotic Cardiovascular Disease: A Pilot Comparative Analysis

  • Pengfei Li; 
  • Yinfei Xu; 
  • Xiang Liu; 
  • Zhean Shen; 
  • Yi Wang; 
  • Xinyi Lv; 
  • Ziyi Lu; 
  • Hui Wu; 
  • Jiaqi Zhuang; 
  • Yan Chen

ABSTRACT

Background:

Large language models (LLMs) have emerged as promising tools for enhancing public access to medical information, particularly for chronic diseases such as atherosclerotic cardiovascular disease (ASCVD). However, their effectiveness in patient-centered health communication remains underexplored, especially in multilingual contexts.

Objective:

This study aimed to compare the performance of state-of-the-art LLMs in generating reliable health information for ASCVD patients across English and Chinese, to evaluate their suitability for public health communication and patient education.

Methods:

We evaluated the performance of three advanced LLMs—Deepseek-R1, ChatGPT-4o, and Google Gemini—in generating accurate, complete, and comprehensible responses to 25 open-ended ASCVD-related questions in both English and Chinese. Each question was submitted five times per model per language, yielding 750 responses. Responses were blindly assessed by three board-certified cardiologists using standardized criteria.

Results:

Deepseek-R1 achieved the highest "Good response" rates (96% in both English and Chinese), outperforming ChatGPT-4o (84%) and Gemini (48% in English, 68% in Chinese). It demonstrated significantly superior accuracy and completeness across both languages (p < 0.01), while all models performed comparably in comprehensibility. No significant differences were observed between English and Chinese performance for Deepseek-R1 and ChatGPT-4o; Gemini showed greater variability but without statistical significance. Diagnostic and definitional questions yielded stronger responses than treatment or prevention topics.

Conclusions:

Deepseek-R1 showed robust and consistent performance in patient-facing ASCVD queries across languages, highlighting the potential of open-source LLMs in promoting digital health literacy and equitable access to chronic disease information. These findings support the cautious integration of LLMs into public health communication strategies, alongside expert oversight and continued validation.


 Citation

Please cite as:

Li P, Xu Y, Liu X, Shen Z, Wang Y, Lv X, Lu Z, Wu H, Zhuang J, Chen Y

Large Language Models in Patient Health Communication for Atherosclerotic Cardiovascular Disease: Pilot Cross-Sectional Comparative Analysis

JMIR Med Inform 2026;14:e81422

DOI: 10.2196/81422

PMID: 41499171

PMCID: 12824577

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.