JMIR Preprints #105970: Large Language Models in Gastrointestinal Endoscopy: From Data Structuring to Clinical Decision-Making and Communication

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Large Language Models in Gastrointestinal Endoscopy: From Data Structuring to Clinical Decision-Making and Communication

Zhijie Jiang;
Angda Ji;
Yi Mou;
Bing Hu;
Xianglei Yuan

ABSTRACT

Large language models (LLMs) are rapidly being adopted to augment clinical workflows in gastrointestinal (GI) endoscopy, where vast multimodal data must be interpreted, documented, and translated into guideline-concordant management and patient communication. Early prototypes look promising, but the evidence comes from disparate study designs and evaluation methods that are hard to compare, leaving the real-world value of these systems unclear. In this Viewpoint, we argue that evaluating LLMs task by task obscures how they behave once embedded in the endoscopic process, and that a systems-level perspective is needed. We propose a pipeline-based conceptual framework that organizes LLM applications into four interconnected layers—data structuring, perception and interpretation, clinical decision-making, and patient communication—spanning the full path from raw data to patient interaction. Our key message is that performance is uneven across this pipeline: it is generally higher in text-centric tasks and degrades in complex multimodal reasoning and individualized decision support, and, critically, errors introduced upstream can propagate downstream to compromise clinical decisions and patient-facing outputs. Reading the pipeline as a whole, we surface the cross-layer risks and key barriers that isolated evaluations miss, and outline directions for integrated end-to-end evaluation, prospective real-world validation, stronger multimodal reasoning, and knowledge-grounded architectures. We advance this framework to guide rigorous assessment and the responsible translation of LLMs into routine GI endoscopic care.

Citation

Please cite as:

Jiang Z, Ji A, Mou Y, Hu B, Yuan X

Large Language Models in Gastrointestinal Endoscopy: From Data Structuring to Clinical Decision-Making and Communication

JMIR Preprints. 01/07/2026:105970

DOI: 10.2196/preprints.105970

URL: https://preprints.jmir.org/preprint/105970

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Currently submitted to: Journal of Medical Internet Research

Date Submitted: Jul 1, 2026

Open Peer Review Period: Jul 2, 2026 - Aug 27, 2026

(currently open for review)

Large Language Models in Gastrointestinal Endoscopy: From Data Structuring to Clinical Decision-Making and Communication

ABSTRACT

Citation

Copyright