Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Nursing

Date Submitted: Nov 27, 2025
Date Accepted: May 4, 2026

The final, peer-reviewed published version of this preprint can be found here:

Voice-Based Structured Nursing Documentation Using Automatic Speech Recognition and Large Language Models: Development and Evaluation Study

Su MH, Wang WC, Hsh YM, Hou SY, Chuang SJ, Chang SS

Voice-Based Structured Nursing Documentation Using Automatic Speech Recognition and Large Language Models: Development and Evaluation Study

JMIR Nursing 2026;9:e88567

DOI: 10.2196/88567

PMID: 42247576

Voice-Based Structured Nursing Documentation Using Speech Recognition and Large Language Models: Development and Evaluation Study

  • Meng-Han Su; 
  • Wei-Chun Wang; 
  • Yi-Min Hsh; 
  • Shih-Yen Hou; 
  • Su-Jung Chuang; 
  • Shih-Sheng Chang

ABSTRACT

Background:

For clinical nurses, manually entering information into hospital information systems (HIS) remains time consuming and prone to omissions. Although speech recognition can reduce the need for manual entry, its use in clinical settings has historically been limited by code switching, medical terminology, and noisy ward environments. Recent advances in customized automatic speech recognition (ASR) and large language models (LLMs) now make speech based, structured documentation aligned with nursing frameworks such as D.A.R.T. (DaThis study developed and evaluated an integrated ASR and LLM system that transforms spoken nursing input into structured D.A.R.T. notes, evaluating its accuracy, usability, and clinical feasibility within HIS workflows.ta, Action, Response, Teaching) increasingly feasible.

Objective:

This study developed and evaluated an integrated ASR and LLM system that transforms spoken nursing input into structured D.A.R.T. notes, evaluating its accuracy, usability, and clinical feasibility within HIS workflows.

Methods:

A code-switching nursing speech corpus from emergency and ward settings was used to fine tune the Whisper Large v2 model with parameter efficient adaptation[1]. The LLM generated schema constrained D.A.R.T. records from ASR transcripts, which were verified by nurses before being uploaded to the corresponding HIS fields. Evaluation included mixed error rate (MER) for ASR accuracy, F1 scores and agreement statistics for D.A.R.T. classification, hallucination assessments based on factual correctness, and analysis of nurse feedback on system use.

Results:

The fine tuned ASR model reduced the mixed error rate from 31.65% to 6.6%. D.A.R.T. generation achieved an F1 score of 0.82 and met the non-inferiority margin relative to human transcripts (Δ = −0.04).The hallucination rate was 2.51%. During deployment, monthly documentation volumes increased from 32,724 to 65,417, and 76% of the 120 participating nurses reported reduced workload and improved completeness.

Conclusions:

The integrated ASR and LLM system was feasible and showed strong performance, with good acceptance among clinical nurses. It reduced manual documentation burden, improved record completeness, and demonstrated the value of an ASR and LLM supported workflow for nursing documentation.


 Citation

Please cite as:

Su MH, Wang WC, Hsh YM, Hou SY, Chuang SJ, Chang SS

Voice-Based Structured Nursing Documentation Using Automatic Speech Recognition and Large Language Models: Development and Evaluation Study

JMIR Nursing 2026;9:e88567

DOI: 10.2196/88567

PMID: 42247576

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.