Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Previously submitted to: JMIR Mental Health (no longer under consideration since Jan 16, 2026)

Date Submitted: Dec 7, 2025
Open Peer Review Period: Dec 12, 2025 - Jan 16, 2026
(closed for review but you can still tweet)

NOTE: This is an unreviewed Preprint

Warning: This is a unreviewed preprint (What is a preprint?). Readers are warned that the document has not been peer-reviewed by expert/patient reviewers or an academic editor, may contain misleading claims, and is likely to undergo changes before final publication, if accepted, or may have been rejected/withdrawn (a note "no longer under consideration" will appear above).

Peer review me: Readers with interest and expertise are encouraged to sign up as peer-reviewer, if the paper is within an open peer-review period (in this case, a "Peer Review Me" button to sign up as reviewer is displayed above). All preprints currently open for review are listed here. Outside of the formal open peer-review period we encourage you to tweet about the preprint.

Citation: Please cite this preprint only for review purposes or for grant applications and CVs (if you are the author).

Final version: If our system detects a final peer-reviewed "version of record" (VoR) published in any journal, a link to that VoR will appear below. Readers are then encourage to cite the VoR instead of this preprint.

Settings: If you are the author, you can login and change the preprint display settings, but the preprint URL/DOI is supposed to be stable and citable, so it should not be removed once posted.

Submit: To post your own preprint, simply submit to any JMIR journal, and choose the appropriate settings to expose your submitted version as preprint.

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

A Hybrid AI-Human Mental Health System with a Clinician-Scribe-in-the-Loop Layer: A Privacy-Preserving, Culturally-Informed Framework for Multilingual Populations in India

  • Sathya moorthy Buma Sridhar

ABSTRACT

Background:

The abstract is structured using the standard format for a systematic review and framework proposal (Background, Objective, Methods, Results, Conclusion).

Background:

Mental health disorders are rising globally, but access to qualified professionals is limited, particularly in low- and middle-income countries like India. While hybrid AI-human systems have demonstrated clear advantages in terms of safety and trust over standalone AI , existing reviews rarely examine their applicability for linguistically and culturally diverse populations. A critical evidence gap exists regarding robust safety protocols, multilingual support, and privacy-preserving architectures.

Objective:

The objective of this systematic review was to evaluate hybrid AI-human mental health systems developed between 2020 and 2025, with a focus on safety, clinical supervision, multilingual adaptation, and privacy mechanisms. Additionally, this study aims to propose a novel, integrated hybrid architecture suitable for large, linguistically diverse populations such as India.

Methods:

Following the PRISMA 2020 guidelines , searches were conducted across five major databases (PubMed, Scopus, IEEE Xplore, ACM DL, Google Scholar) covering the period January 2020 to March 2025. A total of 2,847 records were screened, resulting in the final inclusion of 56 empirical studies. Study quality was assessed using RoB-2, NOS, and MMAT.

Results:

Only 11 studies (19%) implemented structured clinician supervision, which consistently demonstrated improved trust (35%-50%) and reduced crisis events (40%-55%) compared to AI-only systems. However, multilingual support appeared in only 22% of studies, with true cultural adaptation in a mere 4%. Furthermore, privacy-preserving mechanisms (e.g., federated learning) were implemented in only 15% of systems. Research originating from India represented only 5% of the included studies, underscoring a major evidence gap. Conclusion: Hybrid AI-human models offer significant advantages in safety, trust, and engagement, but global adoption is limited by critical shortcomings in multilingual capability, cultural adaptation, and privacy-by-design architecture. To address these needs, we propose a new, integrated hybrid framework that introduces a Clinician-Scribe-in-the-Loop layer. This architecture embeds human expertise for culturally-informed data enrichment and oversight at the input stage, enabling safer, scalable, and more equitable digital mental health support for regions characterized by linguistic diversity and high treatment gaps, such as India.

Objective:

The objective of this study is to systematically evaluate hybrid Artificial Intelligence (AI)-human mental health systems developed between January 2020 and March 2025, with a particular focus on their safety, clinical supervision models, multilingual adaptation, and privacy-preserving mechanisms. Additionally, this review aims to identify significant gaps in the existing digital mental health literature regarding their applicability to linguistically diverse and resource-constrained populations like those in India. Finally, this study proposes a comprehensive, culturally aligned hybrid architecture, incorporating a Clinician-Scribe-in-the-Loop layer, to guide future development and enable safer, scalable, and equitable mental health support.

Methods:

The systematic review followed the PRISMA 2020 guidelines for evidence synthesis. The review protocol was defined a priori and adhered to best practices in digital health evidence synthesis. 1. Eligibility Criteria Studies were included if they met the following criteria: Population: Users seeking mental health support or psychological wellbeing interventions. Intervention: Systems involving AI-assisted, LLM-based, or algorithmic mental health support with explicit human involvement (clinician, counselor, moderator, supervisor). Outcomes: Engagement, safety, clinical effects, privacy mechanisms, multilingual usability, or system architecture. Study Type: Randomized trials, observational studies, feasibility studies, and development/technical evaluations. Timeframe: January 1, 2020, to March 30, 2025. Language: English. Publication: Peer-reviewed journal or conference proceedings. Exclusion Criteria: Studies were excluded if they focused only on standalone AI without human involvement, lacked empirical data, or were reviews, commentaries, or opinion pieces. 2. Data Sources and Search Strategy Searches were conducted across five major databases: PubMed Scopus IEEE Xplore ACM Digital Library (ACM DL) Google Scholar The search strategy combined Boolean terms related to the core concepts: Condition: "mental health", "depression", "anxiety", "wellbeing" Technology: "AI", "chatbot", "LLM", "conversational agent" Hybrid Model: "hybrid", "clinical supervision", "human-in-the-loop", "clinician-in-the-loop", "human supervised", "clinical oversight" Gaps: "privacy", "multilingual", "cultural adaptation" 3. Study Selection and Data Extraction A total of 2,847 records were initially identified. After duplicate removal, two reviewers independently screened titles and abstracts. Full texts of potentially eligible studies were assessed using predefined criteria. Disagreements were resolved through discussion or senior reviewer arbitration. The final inclusion count was 56 studies. A PRISMA flow diagram summarized the selection process. Two reviewers independently extracted data using a structured form to minimize bias. The captured data included: Study characteristics (country, year, design, sample size) AI architecture and model type Nature of human involvement (clinician, moderator, peer supporter) Multilingual and cultural adaptation features Privacy-preserving mechanisms LLM-specific safety controls Engagement metrics and clinical outcomes 4. Quality Assessment and Synthesis Quality Assessment: Quality appraisal was performed for each study and rated as low, moderate, or high risk of bias. Tools used included: RoB-2 (Risk of Bias 2) for randomized controlled trials (RCTs) Newcastle-Ottawa Scale (NOS) for observational studies Mixed Methods Appraisal Tool (MMAT) for development/feasibility studies Synthesis Approach: Due to heterogeneity in study designs, interventions, and outcome measures, a narrative synthesis approach was used. Findings were grouped and analyzed under six key themes: Hybrid AI-human system architecture Clinical supervision and safety Multilingual capability and cultural adaptation Privacy-preserving mechanisms LLM safety controls Engagement and clinical effects Quantitative trends were reported where possible (e.g., trust improvement, crisis reduction).

Results:

The results of the systematic review on hybrid AI-human mental health systems (2020-2025) are summarized below, organized by the key themes analyzed. Study Selection and CharacteristicsTotal Records: 2,847 records were screened; 56 studies met the final inclusion criteria.Study Types: The included studies comprised: 18 randomized controlled trials (RCTs), 23 observational studies, and 15 development or feasibility studies.Sample Size: The median sample size was 243 participants.Geographic Origin: The majority of studies originated from high-income regions:United States (39%)Europe (27%)Asia (21%)India (5%) Hybrid AI-Human Mental Health ModelsThe evaluation of hybrid models, where human clinicians or supervisors are explicitly involved, demonstrated consistent advantages over standalone AI systems.Clinician Supervision Adoption: Only 11 studies (19%) implemented structured clinician or human-supervisor involvement.Performance Metrics for Hybrid Systems:Trust Improvement: Trust scores increased by 35% to 50% compared with AI-only systems.Crisis Events: Crisis events decreased by 40% to 55% in systems with human escalation protocols.User Satisfaction: User satisfaction was higher (mean $4.3/5$) compared to AI-only systems ($3.5/5$).Qualitative Benefits: Hybrid workflows consistently supported context correction, ethical alignment, and safer crisis management. Multilingual and Cultural AdaptationThis area revealed the most significant evidence gap, particularly for highly diverse populations.Multilingual Support: Only 12 studies (22%) provided multilingual support, primarily focusing on English-Spanish or English-Mandarin.True Cultural Adaptation: Only 2 studies (4%) conducted true cultural adaptation, which included local idioms, emotion constructs, and culturally sensitive phrasing.India-Specific Research: Indian languages were addressed in only 3 studies, none of which implemented LLM-based cultural tuning. Privacy and Data Protection MechanismsPrivacy engineering was identified as a major deficit across the literature.Adoption Rate: Advanced privacy-preserving methods were reported in only 15% of studies.Specific Mechanisms Implemented:Federated learning (n=4)Differential privacy (n=3)On-device inference/processing (n=2)Lacking Mechanisms: A large majority (85%) of studies relied solely on basic encryption or platform-level security, lacking meaningful privacy engineering. Studies using federated learning showed better user retention (+12-18%), suggesting a link to higher perceived safety. Voice Journaling and LLM-Based SystemsVoice Journaling: Seven studies integrated voice journaling or voice biomarkers.Engagement: Engagement improved to 78% in voice-based systems, compared with 53% in text-only systems.Voice journaling was especially effective for low-literacy and older users.LLM-Driven Interventions: LLM-based tools appeared in 17 studies (30%), mostly after 2023.Safety Strategies Implemented: Content moderation (n=12), crisis detection classifiers (n=10), human validation or supervisory review (n=6), and alignment with psychological guidelines (n=7) were the primary safety mechanisms.Unpredictable outputs and safety risks were frequently reported, highlighting the need for hybrid oversight.

Conclusions:

Hybrid AI-human mental health systems demonstrate clear advantages over standalone AI tools, particularly in terms of trust, safety, engagement, and crisis management. The systems with structured clinical supervision showed significantly improved outcomes, including a 35% to 50% increase in trust and a 40% to 55% reduction in crisis events. Although Large Language Model (LLM)-driven systems have expanded rapidly since 2023, their effectiveness still depends heavily on structured human oversight and robust safety mechanisms. Key Gaps and Insufficiency of Current Systems Despite these global advancements, widespread and equitable adoption remains limited due to critical, recurring gaps identified in the review: Multilingual and Cultural Alignment: Culturally adapted and multilingual systems are rare, with true cultural adaptation found in only 4% of studies. Privacy Engineering: Advanced privacy-preserving architectures were uncommon, with 85% of systems relying solely on basic encryption despite high user concern. Geographic Imbalance: Research focused on countries with high linguistic diversity and large treatment gaps, such as India, represents only 5% of the included studies. For countries like India—characterized by linguistic diversity, high digital penetration, and a large treatment gap—current global systems are demonstrably insufficient. Proposed Framework and Future Direction This review highlights the urgent need for integrated, privacy-first, clinician-supervised, and multilingual digital mental health frameworks tailored to local contexts. To address these gaps, we propose a comprehensive hybrid architecture that combines AI-driven personalization with human expertise, specifically through a Clinician-Scribe-in-the-Loop layer. This novel framework aims to enable safer, scalable, and culturally aligned mental health support. Future research must prioritize real-world clinical validation of such hybrid models, culturally grounded LLM alignment, and rigorous privacy engineering. By advancing hybrid models that effectively balance technological innovation with clinical responsibility, digital mental health systems can more effectively and equitably meet the needs of diverse global populations. Clinical Trial: Protocol and Registration The systematic review followed the PRISMA 2020 guidelines for reporting evidence synthesis. The review protocol was defined a priori and adhered to best practices in digital health evidence synthesis. Although the review protocol was rigorously documented, the systematic review was not prospectively registered in an external database (e.g., PROSPERO). This is a limitation acknowledged in the Discussion section. The primary contribution of this work is the synthesis of evidence leading to the proposal of a novel integrated hybrid architecture to address critical gaps in cultural, multilingual, and privacy mechanisms for diverse populations. This statement is factual, meets the reporting requirement, and immediately frames the systematic review as completed and the framework proposal as the novel output, which is the core argument of your submission.


 Citation

Please cite as:

Buma Sridhar Sm

A Hybrid AI-Human Mental Health System with a Clinician-Scribe-in-the-Loop Layer: A Privacy-Preserving, Culturally-Informed Framework for Multilingual Populations in India

JMIR Preprints. 07/12/2025:89120

DOI: 10.2196/preprints.89120

URL: https://preprints.jmir.org/preprint/89120

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.