JMIR Preprints #80557: Improving large language model applications in medical and nursing with retrieval-augmented generation: A Scoping Review

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Improving large language model applications in medical and nursing with retrieval-augmented generation: A Scoping Review

Yiqun Miao;
Yuhan Zhao;
Yuan luo;
huiying wang;
ying wu

ABSTRACT

Background:

Retrieval-Augmented Generation (RAG) is increasingly applied in medicine and nursing to enhance large language models. However, there remains a lack of comprehensive understanding regarding its specific architecture and its application in medical reasoning.

Objective:

This study aims to investigate the current state and emerging trends of RAG in the medical and nursing domains.

Methods:

PubMed, Web of Science, IEEE Xplore, and arXiv were searched for relevant articles using queries that combine terms related to RAG, large language model, medical, and nursing. This review was conducted following the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews) guidelines.

Results:

A total of 917 articles were retrieved, of which 67 met the inclusion criteria. The RAG frameworks included in this review were categorized into five functional types: text-based RAG (n=36), knowledge graph-enhanced RAG (n=17), agentic RAG (n=6), multimodal RAG (n=2), and plug-and-play RAG (n=6). Based on a staged decomposition of the RAG workflow into intent recognition, knowledge retrieval, knowledge integration, and generation stage, we analyzed the specific techniques employed at each stage across the included studies. Despite the growing emphasis on reasoning, only 26 studies incorporated explicit reasoning mechanisms, and few aligned with the procedural logic of clinical or nursing workflows.

Conclusions:

This study revealed that recent advancements in medical and nursing RAG frameworks demonstrate four key transformations: from surface-level matching to contextualized intent recognition; from vague semantics to logic-driven dynamic retrieval; from passive to active knowledge retrieval; and from simple aggregation to coherent context construction. However, most RAG systems in medical and nursing have not yet introduced reasoning methods, and those that have are still predominantly reliant on statistical associations. This underscores the necessity of incorporating causal mechanisms to achieve more profound and domain-specific reasoning.

Citation

Please cite as:

Miao Y, Zhao Y, luo Y, wang h, wu y

Improving Large Language Model Applications in the Medical and Nursing Domains With Retrieval-Augmented Generation: Scoping Review

J Med Internet Res 2025;27:e80557

DOI: 10.2196/80557

PMID: 41118646

PMCID: 12587015

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jul 12, 2025

Date Accepted: Sep 5, 2025

Improving large language model applications in medical and nursing with retrieval-augmented generation: A Scoping Review

ABSTRACT

Citation

Copyright