JMIR Preprints #92817: AI Chatbot Suicide Risk Detection and Response: Human Validation of the Open-Source VERA-MH Safety Evaluation

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

AI Chatbot Suicide Risk Detection and Response: Human Validation of the Open-Source VERA-MH Safety Evaluation

Kate H Bentley;
Luca Belli;
Adam M. Chekroud;
Emily J. Ward;
Emily R. Dworkin;
Emily Van Ark;
Kelly M. Johnston;
Will Alexander;
Millard Brown;
Matt Hawrilenko

ABSTRACT

Background:

Millions of people now use leading generative AI tools (chatbots) for psychological support. Despite the promise related to availability and scale, the single most pressing question in AI for mental health is whether these tools are safe. The field currently lacks a clinically validated, evidence-based benchmark for determining AI safety for mental health. The Validation of Ethical and Responsible AI in Mental Health (VERA-MH) evaluation was recently proposed to meet the urgent need for a rigorous, automated safety benchmark.

Objective:

This study aims to examine the clinical validity and reliability of the recently developed VERA-MH evaluation for AI safety in suicide risk detection and response.

Methods:

We simulated a large set of conversations between large language model (LLM)-based users (“user-agents”) spanning a wide range of suicide risk levels and disclosure styles and general-purpose AI chatbots. Licensed mental health clinicians used a rubric (scoring guide) developed for VERA-MH to independently rate the simulated conversations for safe and unsafe chatbot behaviors, as well as user-agent realism. An LLM-based evaluator (the VERA-MH “judge”) used the same scoring rubric to evaluate the same set of simulated conversations. We then compared rating alignment across (a) individual clinicians and (b) clinician consensus and the LLM judge, and (c) examined clinicians’ ratings of user-agent realism.

Results:

Individual clinicians were generally consistent with one another in their safety ratings (chance-corrected inter-rater reliability [IRR]: 0.77), thus establishing a gold-standard clinical reference. The LLM judge was strongly aligned with this clinical consensus (IRR: 0.81) overall and within key conditions. Clinician raters generally perceived the user-agents to be realistic.

Conclusions:

For the potential mental health benefits of AI chatbots to be realized, attention to safety is paramount. Findings from this human evaluation study support the clinical validity and reliability of VERA-MH: an open-source, fully automated AI safety evaluation for mental health. Further research will address VERA-MH generalizability and robustness, and future expansions can target other key areas of AI safety for mental health.

Citation

Please cite as:

Bentley KH, Belli L, Chekroud AM, Ward EJ, Dworkin ER, Van Ark E, Johnston KM, Alexander W, Brown M, Hawrilenko M

AI Chatbot Suicide Risk Detection and Response: Human Validation Study of the Open-Source VERA-MH Safety Evaluation

JMIR AI 2026;5:e92817

DOI: 10.2196/92817

PMID: 42257560

PMCID: 13365878

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR AI

Date Submitted: Feb 3, 2026

Date Accepted: Jun 2, 2026

Date Submitted to PubMed: Jun 8, 2026

AI Chatbot Suicide Risk Detection and Response: Human Validation of the Open-Source VERA-MH Safety Evaluation

ABSTRACT

Citation

Copyright