JMIR Preprints #88567: Voice-Based Structured Nursing Documentation Using Speech Recognition and Large Language Models: Development and Evaluation Study

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Voice-Based Structured Nursing Documentation Using Speech Recognition and Large Language Models: Development and Evaluation Study

Meng-Han Su;
Wei-Chun Wang;
Yi-Min Hsh;
Shih-Yen Hou;
Su-Jung Chuang;
Shih-Sheng Chang

ABSTRACT

Background:

For clinical nurses, manually entering information into hospital information systems (HIS) remains time consuming and prone to omissions. Although speech recognition can reduce the need for manual entry, its use in clinical settings has historically been limited by code switching, medical terminology, and noisy ward environments. Recent advances in customized automatic speech recognition (ASR) and large language models (LLMs) now make speech based, structured documentation aligned with nursing frameworks such as D.A.R.T. (DaThis study developed and evaluated an integrated ASR and LLM system that transforms spoken nursing input into structured D.A.R.T. notes, evaluating its accuracy, usability, and clinical feasibility within HIS workflows.ta, Action, Response, Teaching) increasingly feasible.

Objective:

This study developed and evaluated an integrated ASR and LLM system that transforms spoken nursing input into structured D.A.R.T. notes, evaluating its accuracy, usability, and clinical feasibility within HIS workflows.

Methods:

A code-switching nursing speech corpus from emergency and ward settings was used to fine tune the Whisper Large v2 model with parameter efficient adaptation[1]. The LLM generated schema constrained D.A.R.T. records from ASR transcripts, which were verified by nurses before being uploaded to the corresponding HIS fields. Evaluation included mixed error rate (MER) for ASR accuracy, F1 scores and agreement statistics for D.A.R.T. classification, hallucination assessments based on factual correctness, and analysis of nurse feedback on system use.

Results:

The fine tuned ASR model reduced the mixed error rate from 31.65% to 6.6%. D.A.R.T. generation achieved an F1 score of 0.82 and met the non-inferiority margin relative to human transcripts (Δ = −0.04).The hallucination rate was 2.51%. During deployment, monthly documentation volumes increased from 32,724 to 65,417, and 76% of the 120 participating nurses reported reduced workload and improved completeness.

Conclusions:

The integrated ASR and LLM system was feasible and showed strong performance, with good acceptance among clinical nurses. It reduced manual documentation burden, improved record completeness, and demonstrated the value of an ASR and LLM supported workflow for nursing documentation.

Citation

Please cite as:

Su MH, Wang WC, Hsh YM, Hou SY, Chuang SJ, Chang SS

Voice-Based Structured Nursing Documentation Using Automatic Speech Recognition and Large Language Models: Development and Evaluation Study

JMIR Nursing 2026;9:e88567

DOI: 10.2196/88567

PMID: 42247576

PMCID: 13240795

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Nursing

Date Submitted: Nov 27, 2025

Date Accepted: May 4, 2026

Voice-Based Structured Nursing Documentation Using Speech Recognition and Large Language Models: Development and Evaluation Study

ABSTRACT

Citation

Copyright