JMIR Preprints #90491: Beyond Acoustic Features: Incorporating Linguistic Variables in Automatic Speech Analysis for Depression Detection

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Beyond Acoustic Features: Incorporating Linguistic Variables in Automatic Speech Analysis for Depression Detection

Patricia Laura Maran;
Peru Gabirondo;
Alexandra Vlaic;
Maria Teresa Alonzo-Castillo;
Tomas Rojo;
Carla Zaldua;
Julia Vendrell-Serres;
Josep Antoni Ramos-Quiroga;
María Dolores Braquehais;
Amanda Rodríguez-Urrutia

ABSTRACT

Background:

While most research on automatic speech analysis (ASA) has focused on acoustic features, the potential of linguistic markers remains underexplored, particularly in clinically diagnosed, non-English-speaking populations.

Objective:

This study evaluated the integration of acoustic and linguistic markers for detecting depression in a Spanish-speaking clinical sample.

Methods:

The sample included 80 patients with major depressive disorder (MDD) or persistent depressive disorder (PDD) from the Psychiatry Department of Vall d'Hebron University Hospital and 71 healthy controls. Participants answered 11 open-ended questions related to depressive symptoms and well-being via a web-based platform. Linguistic and acoustic variables spanning four categories, namely, prosodic, cepstral, spectral, and Teager Energy Operator (TEO)-based features, were extracted. Group comparisons and logistic regressions were performed to assess the predictive value of acoustic and linguistic features. Machine learning models were used to compare the performance of acoustic, linguistic, and ensemble classification models combining both feature sets.

Results:

TEO-based and cepstral features showed the strongest predictive power. Among linguistic features, greater use of verbs, reduced use of nouns and past-tense verbs, smaller vocabulary size, and increased use of shorter words and sentences remained strong predictors of depression after adjusting for covariates. The linguistic model outperformed the acoustic model (AUC = 0.86 vs. = 0.79), while the ensemble model combining both acoustic and linguistic features achieved comparable overall performance (AUC = 0.86), with slightly improved accuracy (0.84) and specificity (0.93).

Conclusions:

Linguistic features differentiated patients with depression from healthy controls. With further validation and refinement, brief speech-based assessments could aid early depression detection in primary care and community settings.

Citation

Please cite as:

Maran PL, Gabirondo P, Vlaic A, Alonzo-Castillo MT, Rojo T, Zaldua C, Vendrell-Serres J, Ramos-Quiroga JA, Braquehais MD, Rodríguez-Urrutia A

Beyond Acoustic Features: Incorporating Linguistic Variables in Automatic Speech Analysis for Depression Detection

JMIR Preprints. 29/12/2025:90491

DOI: 10.2196/preprints.90491

URL: https://preprints.jmir.org/preprint/90491

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Previously submitted to: JMIR Mental Health (no longer under consideration since Dec 31, 2025)

Date Submitted: Dec 29, 2025

Beyond Acoustic Features: Incorporating Linguistic Variables in Automatic Speech Analysis for Depression Detection

ABSTRACT

Citation

Copyright