Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Education

Date Submitted: Nov 4, 2025
Date Accepted: Dec 29, 2025

The final, peer-reviewed published version of this preprint can be found here:

Using AI to Train Future Clinicians in Depression Assessment: Feasibility Study

Holderried F, Sonanini A, Philipps A, Stegemann-Philipps C, Herschbach L, Festl-Wietek T, Zipfel S, Erschens R, Herrmann-Werner A

Using AI to Train Future Clinicians in Depression Assessment: Feasibility Study

JMIR Med Educ 2026;12:e87102

DOI: 10.2196/87102

PMID: 41678789

PMCID: 12946781

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Using AI to train future clinicians in depression assessment: a feasibility study.

  • Friederike Holderried; 
  • Alessandra Sonanini; 
  • Annika Philipps; 
  • Christian Stegemann-Philipps; 
  • Lea Herschbach; 
  • Teresa Festl-Wietek; 
  • Stephan Zipfel; 
  • Rebecca Erschens; 
  • Anne Herrmann-Werner

ABSTRACT

Background:

Depression is a major global healthcare challenge, causing significant individual distress but also contributing to a substantial global burden. Timely and accurate diagnosis is crucial. To help future clinicians develop these essential skills, we trained a GPT-powered chatbot to simulate patients with varying degrees of depression and suicidality.

Objective:

The Objective of this study is to evaluate the applicability and transferability of our GPT-4-powered chatbot for psychosomatic cases. Specifically, we aim to investigate how accurately the chatbot can simulate patients exhibiting various stages of depression and phases of suicidal ideation, while adhering to a predefined role script and maintaining a sufficient level of authenticity. Additionally, we want to analyze to what level the chatbot is suitable for practicing correctly diagnosing depressive disorders in patients as well as assessing suicidality stages.

Methods:

Medical students interacted with these virtual patients via chat. We assessed the perceived authenticity and role script adherence and analyzed chat transcripts to gain further insight into the chatbot's behavior and the students' diagnostic accuracy.

Results:

In over 90% of cases, the chatbot maintained its assigned role. On average, students correctly identified the severity of depression in 60% and the phase of suicidality in 67% of the cases. Notably, the majority either failed to address or insufficiently explored the topic of suicidality despite explicit instructions beforehand.

Conclusions:

This study demonstrates that a GPT-powered chatbot can reliably simulate depressive patients. However, we identified potential challenges in designing such bots. Students reported enjoying the training and perceived the conversations as rather authentic. However, their diagnostic accuracy varied depending on the severity of the case, highlighting the need for targeted training. AI-supported virtual patients provide a highly flexible, standardized, and readily available training tool, independent of real-life constraints.


 Citation

Please cite as:

Holderried F, Sonanini A, Philipps A, Stegemann-Philipps C, Herschbach L, Festl-Wietek T, Zipfel S, Erschens R, Herrmann-Werner A

Using AI to Train Future Clinicians in Depression Assessment: Feasibility Study

JMIR Med Educ 2026;12:e87102

DOI: 10.2196/87102

PMID: 41678789

PMCID: 12946781

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.