Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Research Protocols

Date Submitted: Mar 3, 2023
Date Accepted: May 25, 2023

The final, peer-reviewed published version of this preprint can be found here:

Detecting Clinically Relevant Emotional Distress and Functional Impairment in Children and Adolescents: Protocol for an Automated Speech Analysis Algorithm Development Study

Alemu, Ph.D. Y, Chen L, Duan C, Caulley, Ph.D. D, Arriaga, Ph.D. R, Sezgin, PhD E

Detecting Clinically Relevant Emotional Distress and Functional Impairment in Children and Adolescents: Protocol for an Automated Speech Analysis Algorithm Development Study

JMIR Res Protoc 2023;12:e46970

DOI: 10.2196/46970

PMID: 37351936

PMCID: 10337292

Detecting clinically relevant emotional distress and functional impairments in children and adolescents: An automated speech analysis algorithm development study

  • Yared Alemu, Ph.D.; 
  • Larry Chen; 
  • Chenghao Duan; 
  • Desmond Caulley, Ph.D.; 
  • Rosa Arriaga, Ph.D.; 
  • Emre Sezgin, PhD

ABSTRACT

Background:

Even before the onset of the COVID-19 pandemic, children and adolescents were experiencing a mental health crisis, partly due to a lack of quality mental health services. The rate of suicide for black youth has increased by 80%. By 2025, the healthcare system will be short of 225K therapists, further exacerbating the current crisis. Therefore, it's of the utmost importance for providers, schools, youth mental health, and pediatric medical providers to integrate innovation in digital mental health to identify problems proactively and rapidly for effective collaboration with other healthcare providers. Such approaches can help identify robust, reproducible, and generalizable predictors and digital biomarkers of treatment response in psychiatry. Among the multitude of digital innovations to identify a biomarker for psychiatric diseases currently, as part of the macro-level digital health transformation, speech stands out as an attractive candidate with features such as affordability, non-invasive, and non-intrusive.

Objective:

The protocol aims to develop speech-emotion recognition algorithms leveraging AI/ML which can establish a link between trauma, stress, and voice types, including disrupting speech-based characteristics, and detect clinically relevant emotional distress and functional impairments in children and adolescents.

Methods:

Informed by the theoretical foundations (the Theory of Psychological Trauma Biomarkers and Archetypal Voice Categories), we developed our methodology to focus on five emotions: Anger, Happiness, Fear, Neutral, and Sadness. Participants will be recruited from Georgia HOPE and Family Ties in Atlanta, GA. Speech samples, along with responses to the SFSS, PHQ-9, and ACE scales, will be collected using an Android mobile app. Our model development pipeline is informed by Gaussian Mixture Model, Recurrent Neural Network, and Long Short-Term Memory.

Results:

We tested our model with a public dataset. The GMM with 128 clusters showed an evenly distributed accuracy across all five emotions. Using utterance-level features, GMM achieved an accuracy of 79.15% overall, while frame selection increased accuracy to 85.35%. This demonstrates that GMM is a robust model for emotion classification of all five emotions and that emotion frame selection enhances accuracy, which is significant for scientific evaluation.

Conclusions:

This study contributes to the literature as it addresses the need for reliable and objective tools to detect clinically relevant emotional distress and functional impairments in children and adolescents. The results show that our algorithm has the potential to aid mental health providers in diagnosis and treatment planning for pediatric patients.


 Citation

Please cite as:

Alemu, Ph.D. Y, Chen L, Duan C, Caulley, Ph.D. D, Arriaga, Ph.D. R, Sezgin, PhD E

Detecting Clinically Relevant Emotional Distress and Functional Impairment in Children and Adolescents: Protocol for an Automated Speech Analysis Algorithm Development Study

JMIR Res Protoc 2023;12:e46970

DOI: 10.2196/46970

PMID: 37351936

PMCID: 10337292

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.