Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Nursing

Date Submitted: Feb 28, 2022
Date Accepted: Sep 25, 2022

The final, peer-reviewed published version of this preprint can be found here:

Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation

Hou SY, Chen KC, Chang TA, Hsu YM, Chuang SJ, Chang Y, Wu YL, Hsu KC

Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation

JMIR Nursing 2022;5(1):e37562

DOI: 10.2196/37562

PMID: 36476781

PMCID: 9773023

Code-Switching Automatic Speech Recognition for Nursing Record Documentation: Development and Evaluation

  • Shih-Yen Hou; 
  • Kai-Ching Chen; 
  • Ting-An Chang; 
  • Yi-Min Hsu; 
  • Su-Jung Chuang; 
  • Ying Chang; 
  • Ya-Lun Wu; 
  • Kai-Cheng Hsu

ABSTRACT

Background:

Taiwan has insufficient nursing resources due to the high turnover rate of nursing personnel. Therefore, reducing the heavy workload of these employees is essential. Herein, speech transcription, which has various potential clinical applications, was employed for the documentation of nursing records. The requirement of only one speaker facilitates data collection and system development. Moreover, authorization from patients is unnecessary.

Objective:

A speech recognition system for nursing records was constructed such that medical personnel can complete nursing records without typing or with only a few edits.

Methods:

Nursing records in Taiwan are mainly written in Mandarin, with technical terms and abbreviations presented in both Mandarin and English. Therefore, the training set consisted of English code-switching (CS) information. Next, transfer learning (TL) and meta-transfer learning (MTL) methods, which perform favorably in CS scenarios, were applied.

Results:

The word error rate (WER) of the benchmark model of syllables-based TL and the proposed model of syllables-based MTL was 29.54% and 22.20% WER in code-switching, respectively. The test set comprised 17,247 words. Moreover, in a clinical case, the proposed model of syllables-based MTL yielded a WER of 31.06% WER in code-switching. The clinical test set contained 1,159 words.

Conclusions:

Medical personnel in Taiwan are often compelled to use a mixture of Mandarin and English in nursing records. Therefore, a Mandarin–English CS speech recognition system for nursing documentation was developed. The proposed data set has two characteristics, namely the medical field and CS, and lightens the workload of medical personnel.


 Citation

Please cite as:

Hou SY, Chen KC, Chang TA, Hsu YM, Chuang SJ, Chang Y, Wu YL, Hsu KC

Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation

JMIR Nursing 2022;5(1):e37562

DOI: 10.2196/37562

PMID: 36476781

PMCID: 9773023

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.