Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Education

Date Submitted: Dec 19, 2025
Date Accepted: Apr 29, 2026
Date Submitted to PubMed: Apr 29, 2026

The final, peer-reviewed published version of this preprint can be found here:

Ambient AI Scribes to Create Educational Feedback Notes for Medical Students: Randomized Trial

Talwalkar JS, Chartash D, Zhang L, Makutonin M, Safranek CW, Sidamon-Eristoff AE, Schwamm LH, Wright DS

Ambient AI Scribes to Create Educational Feedback Notes for Medical Students: Randomized Trial

JMIR Med Educ 2026;12:e89996

DOI: 10.2196/89996

PMID: 42208058

Ambient AI Scribes to Create Educational Feedback Notes for Medical Students: A Randomized Trial

  • Jaideep S Talwalkar; 
  • David Chartash; 
  • Lisa Zhang; 
  • Michael Makutonin; 
  • Conrad W Safranek; 
  • Anne Elizabeth Sidamon-Eristoff; 
  • Lee H Schwamm; 
  • Donald S Wright

ABSTRACT

Background:

High-quality observation and feedback contribute to the development of clinical competence and professional growth in medical education. Faculty often struggle to translate verbal observations into written feedback because of documentation burden and competing demands. Ambient artificial intelligence (AI) scribes, already adopted in clinical practice, may address this challenge by capturing verbal exchanges and generating structured notes.

Objective:

The purpose of this study was to examine the use of ambient AI scribes to generate educational feedback notes during a formative medical interviewing workshop for first-year medical students in spring 2025.

Methods:

Thirteen instructors were randomized to control (human-only) or intervention (AI scribe-assisted) workflows to complete narrative feedback forms. The intervention group used an AI scribe to generate transcripts of student-instructor encounters, which were then summarized into feedback notes using a large language model, and edited by instructors before submission. All narratives were scored using the Evaluation of Feedback Captured Tool (EFeCT). Factual accuracy of a subsample of unedited AI feedback summaries was reviewed against source transcripts. Task load and usability were measured using NASA Task Load Index and System Usability Scale respectively.

Results:

Instructors submitted feedback on 94 of 102 students (92.2%). Median EFeCT scores on the zero-to-five scale were higher for human-edited AI narratives (3.00) and unedited AI summaries (3.00) compared to human-only narratives (2.00; p<0.001). Human narratives were shorter than AI-assisted outputs (p<0.001). Review of 117 AI-generated feedback elements showed a 6.8% mischaracterization and 1.7% hallucination rate, with most errors corrected during editing. Task load was high and usability marginal in both control and intervention groups, with no significant differences.

Conclusions:

An ambient AI scribe-assisted workflow improved the quality of written narrative feedback with no observed increase in instructor effort compared to human-only documentation. Although occasional inaccuracies required review, this innovation has the potential to transform feedback documentation. Clinical Trial: This study received exemption from review by the Yale University Institutional Review Board due to its educational nature on January 23, 2025 (IRB Protocol ID 2000039478).


 Citation

Please cite as:

Talwalkar JS, Chartash D, Zhang L, Makutonin M, Safranek CW, Sidamon-Eristoff AE, Schwamm LH, Wright DS

Ambient AI Scribes to Create Educational Feedback Notes for Medical Students: Randomized Trial

JMIR Med Educ 2026;12:e89996

DOI: 10.2196/89996

PMID: 42208058

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.