Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Rehabilitation and Assistive Technologies

Date Submitted: Jun 14, 2025
Date Accepted: Dec 3, 2025

The final, peer-reviewed published version of this preprint can be found here:

Assessing the Role of Medical Caption Technology to Support Physician-Patient Communication for Patients With Hearing Loss: Mixed Methods Pilot Study

Hughes SE, Wu LY, Ma LJ, Jain D, McKee MM

Assessing the Role of Medical Caption Technology to Support Physician-Patient Communication for Patients With Hearing Loss: Mixed Methods Pilot Study

JMIR Rehabil Assist Technol 2026;13:e79073

DOI: 10.2196/79073

PMID: 41538699

PMCID: 12806592

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Assessing the Role of Medical Caption Technology to Support Physician-Patient Communication for Patients with Hearing Loss: A Pilot Study

  • Sarah E Hughes; 
  • Liang-Yuan Wu; 
  • Lindsay J Ma; 
  • Dhruv Jain; 
  • Michael M McKee

ABSTRACT

Background:

Speech recognition technology is widely used by D/deaf and hard-of-hearing (DHH) individuals in everyday communication, but its clinical applications remain underexplored. Communication barriers in healthcare can compromise safety, understanding, and autonomy for DHH individuals.

Objective:

This study evaluated a real-time speech recognition system (SRS) tailored for clinical settings, examining its usability, perceived effectiveness, and transcription accuracy among DHH users.

Methods:

We conducted a pilot study with 10 DHH adults participating in mock outpatient encounters using a custom SRS powered by Google’s speech-to-text API. Participants completed post-scenario surveys and structured exit interviews assessing distraction, trust, ease of use, satisfaction, and emotional response. Caption accuracy was benchmarked against professional Communication Access Realtime Translation (CART) transcripts using word error rate (WER).

Results:

Across 29 clinical scenario simulations, 86% of participants found captions non-distracting, 90% reported them easy to follow and trustworthy, and 76% were satisfied with the experience. Participants described the SRS as intuitive, emotionally grounding, and preferable to lipreading in masked settings. WER ranged from 12.7% to 22.8%, consistent with benchmarks for automated speech recognition systems. Interviews revealed themes of increased confidence in following clinical conversations and staying engaged despite masked communication, reduced anxieties about missing critical medical information, and strong interest in expanding the tool to real-world settings, especially for older adults or those with cognitive impairments.

Conclusions:

Our findings support the potential of real-time captioning to enhance accessibility and reduce the cognitive and mental burden of communication for DHH individuals in clinical care. Participants described the SRS as both functionally effective and personally empowering. While accuracy for complex medical terminology remains a limitation, participants consistently expressed trust in the system and a desire for its integration into clinical care. Future research should explore real-world implementation, domain-specific optimization, and the development of user-centered evaluation metrics that extend beyond transcription fidelity to include trust, autonomy, and communication equity. Clinical Trial: N/A


 Citation

Please cite as:

Hughes SE, Wu LY, Ma LJ, Jain D, McKee MM

Assessing the Role of Medical Caption Technology to Support Physician-Patient Communication for Patients With Hearing Loss: Mixed Methods Pilot Study

JMIR Rehabil Assist Technol 2026;13:e79073

DOI: 10.2196/79073

PMID: 41538699

PMCID: 12806592

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.