Accepted for/Published in: JMIR Mental Health
Date Submitted: Jul 25, 2025
Open Peer Review Period: Jul 26, 2025 - Sep 20, 2025
Date Accepted: Sep 13, 2025
(closed for review but you can still tweet)
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Is ChatGPT ready for clinical use in mental healthcare? A scoping review of empirical evidence
ABSTRACT
Background:
As mental health challenges continue to rise globally, there is increasing interest in the use of generative pre-trained transformers (GPT) models, such as ChatGPT, in mental healthcare. A few months after its release, tens of thousands of users interacted with GPT-based therapy bots ,with mental health support identified as the primary use case. ChatGPT offers scalable and immediate support through natural language processing capabilities, but their clinical applicability, safety, and effectiveness remain underexplored.
Objective:
This scoping review aims to provide a comprehensive overview of the main clinical applications of ChatGPT in mental healthcare, along with the existing empirical evidence for its performance.
Methods:
A systematic search was conducted in 8 electronic databases through April 2025 to identify primary studies. Eligible studies included primary research, reporting on the evaluation of a ChatGPT clinical application implemented for a mental healthcare–specific purpose.
Results:
Sixty studies were included in this scoping review. The results highlighted that most applications used generic ChatGPT and focused on the detection of mental health problems and counselling and treatment, while only a minority of studies investigated ChatGPT use in clinical decision facilitation and prognosis tasks. Most of the studies were prompt experiments, in which standardized text inputs—designed to mimic clinical scenarios, patient descriptions, or practitioner queries—are submitted to ChatGPT to evaluate its performance in mental health-related tasks. In terms of performance, ChatGPT shows good accuracy in detecting specific mental health problems, simulating therapeutic conversation, providing psychoeducation, and conducting specific therapeutic strategies. However, ChatGPT has significant limitations with more complex clinical presentations and its overly pessimistic prognostic outputs. Nevertheless, overall, when compared to mental health experts or other artificial intelligence (AI) models, ChatGPT approximates or surpasses their performance in conducting various clinical tasks. Finally, custom ChatGPT use was associated with better performance, especially in counselling and treatment tasks.
Conclusions:
While ChatGPT offers promising capabilities for mental health screening, psychoeducation, and structured therapeutic interactions, its current limitations highlight the need for caution in clinical adoption. These limitations also underscore the need for rigorous evaluation frameworks, model refinement, and safety protocols before broader clinical integration. Moreover, the variability in performance across versions, tasks and diagnostic categories also invites a more nuanced reflection on the conditions under which ChatGPT can be safely and effectively integrated into mental health settings. Clinical Trial: The protocol for scoping review was registered on Open Science Framework https://osf.io/z6kyg
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.