Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Apr 28, 2025
Date Accepted: Jun 23, 2025
A scoping review of generative large language models in stroke care: current landscape and future directions
ABSTRACT
Background:
Stroke has a major impact on global health, causing long-term disability and straining healthcare resources. Generative large language models (gLLMs) have emerged as promising tools to help address these challenges, but their applications and reported performance in stroke care require comprehensive mapping and synthesis.
Objective:
The aim of this scoping review was to consolidate a fragmented evidence base and to examine the current landscape, shortcomings, and future directions in the design, reporting, and evaluation of gLLM-based interventions in stroke care.
Methods:
This scoping review, adhering to Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) guidelines and the Population-Concept-Context (PCC) framework, searched six major scientific databases in December 2024 to evaluate gLLM interventions across the stroke care pathway mapping their key characteristics and outcomes.
Results:
Retrospective designs predominated. Key applications of gLLMs included clinical decision-making support (40%), administrative assistance (36%), direct patient interaction (20%), and automated literature review (4%). Implementations mainly employed Generative Pretrained Transformer (GPT) models accessed through task-prompted chat interfaces. Five key challenges were identified from the included studies during the implementation of gLLM-based interventions: ensuring factual alignment, maintaining system robustness, enhancing interpretability, optimizing efficiency, and facilitating clinical adoption.
Conclusions:
The application of gLLMs in stroke care, while promising, remains relatively new, with most interventions reflecting early-stage or relatively simple implementations. Against this backdrop, critical gaps in research and clinical translation persist. To support the development of clinically impactful and trustworthy applications, we propose an actionable framework that prioritizes real-world evidence, mandates transparent technical reporting, broadens evaluation beyond output accuracy, strengthens validation of advanced task adaptation strategies, and investigates mechanisms for safe and effective human–gLLM interaction. Clinical Trial: The review protocol was pre-registered on the Open Science Framework and is available at: https://doi.org/10.17605/OSF.IO/J36WV.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.