Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Sep 29, 2024
Date Accepted: Feb 5, 2025

The final, peer-reviewed published version of this preprint can be found here:

Stroke Diagnosis and Prediction Tool Using ChatGLM: Development and Validation Study

Song X, Wang J, He F, Yin W, Ma W, Wu J

Stroke Diagnosis and Prediction Tool Using ChatGLM: Development and Validation Study

J Med Internet Res 2025;27:e67010

DOI: 10.2196/67010

PMID: 40009850

PMCID: 11904371

A Stroke Diagnosis and Prediction Tool using ChatGLM: Development and Validation Study

  • Xiaowei Song; 
  • Jiayi Wang; 
  • Feifei He; 
  • Wei Yin; 
  • Weizhi Ma; 
  • Jian Wu

ABSTRACT

Background:

Stroke is a globally prevalent disease that imposes a significant burden on healthcare systems and national economies. Accurate and rapid stroke diagnosis can substantially increase reperfusion rates, mitigate disability, and reduce mortality. However, there are considerable discrepancies in the diagnosis and treatment of acute stroke.

Objective:

The aim of this study is to develop a stroke diagnosis and prediction tool based on ChatGLM3-6B, which utilizes free-text information from electronic health records (EHR) in conjunction with non-contrast computed tomography (NCCT) to enhance stroke detection and treatment.

Methods:

We utilized the free-text information from electronic health records (EHR) in conjunction with non-contrast computed tomography (NCCT) to enhance the detection and treatment of strokes. A total of 1,885 subjects, both stroke and non-stroke patients, were randomly selected from the neurology emergency room at a comprehensive stroke center to serve as our training set. We developed a large language model (LLM) based on ChatGLM3-6B by identifying optimal input combinations, employing external tools, and applying Instruction Tuning and Low-Rank Adaptation (LoRA) techniques. These strategies were implemented to improve the performance of critical procedures in the stroke diagnosis flowchart, and the results were subsequently validated using both internal and external datasets.

Results:

The multimodal LLM, which is based on clinical notes and NCCT, demonstrates exceptionally high accuracy in stroke diagnosis, achieving 99.0% in the internal validation dataset and 95.5% and 79.1% in two external test cohorts. It effectively distinguishes between ischemia and hemorrhage, with an accuracy of 100.0% in the validation dataset and 99.1% and 97.1% in the other test cohorts. Additionally, it identifies large vessel occlusions (LVO) with an accuracy of 80.0% in the validation dataset and 88.6% and 83.3% in the other test cohorts. Furthermore, it screens patients eligible for intravenous thrombolysis (IVT) with an accuracy of 89.4% in the validation dataset and 60.0% and 80.0% in the other test cohorts.

Conclusions:

We developed a large language model (LLM) that leverages clinical text and non-contrast computed tomography (NCCT) to identify strokes and guide recanalization therapy. While our results necessitate validation through widespread deployment, they hold the potential to enhance stroke identification and reduce reperfusion time.


 Citation

Please cite as:

Song X, Wang J, He F, Yin W, Ma W, Wu J

Stroke Diagnosis and Prediction Tool Using ChatGLM: Development and Validation Study

J Med Internet Res 2025;27:e67010

DOI: 10.2196/67010

PMID: 40009850

PMCID: 11904371

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.