JMIR Preprints #75960: Predicting UHR Outcomes Using Linguistic and Acoustic Measures from HiSoC Recordings: A mHealth Longitudinal Cohort Exploratory Study

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Predicting UHR Outcomes Using Linguistic and Acoustic Measures from HiSoC Recordings: A mHealth Longitudinal Cohort Exploratory Study

Samuel Ming Xuan Tan;
May Yen Lieu;
Jun Kai;
Zixu Yang;
Luke K.K;
May O. Lwin;
Jimmy Lee;
Wilson Wen Bin Goh

ABSTRACT

Background:

Early detection of individuals at ultra-high risk (UHR) for psychosis is essential for timely intervention and improved clinical outcomes. However, current UHR assessments, which rely heavily on psychometric tools, often suffer from low specificity. Recent advancements in research suggest that machine learning (ML) can enhance these assessments, particularly through the integration of linguistic and acoustic features.

Objective:

In this study, we investigated the potential of audio recordings from the High-Risk Social Challenge (HiSoC) task in the development of UHR outcome prediction models.

Methods:

Audio recordings of HiSoC task responses were collected from 41 UHR participants (12 converters, 15 remitters, and 14 maintainers) enrolled in the Longitudinal Youth at Risk (LYRIK) study. Responses from the conversion group were obtained within 12 months of psychosis onset, while responses from the remit and maintain groups were collected at baseline. Linguistic features analyzed included Words per Minute (WPM), Articulation Rate (AR), Disfluencies (DF), and Sequential Coherence (SC). Acoustic features comprised mean and standard deviation of fundamental frequency, mean and standard deviation of intensity, and HF500. To investigate differences in linguistic and acoustic features across outcome groups, multivariate regression analysis was performed. Additionally, a Linear Support Vector Machine (SVM) with nested cross-validation was employed to estimate the generalizability error of the predictive models. Model performance was evaluated using balanced accuracy (BA) as the primary metric.

Results:

The conversion outcome group exhibited lower WPM (adj.P = .024) and higher DF (adj.P = .004) compared to the remission outcome group . No significant differences were found in AR, SC or acoustic measures across outcome groups. The model built on acoustic features performed the best in predicting conversion (BA=0.595, 95% CI [0.287, 0.738]). The best performance in predicting remission was achieved by the model combining linguistic and acoustic features (BA=0.851, 95% CI [0.500, 0.920]).

Conclusions:

Linguistic and acoustic features extracted from HiSoC task responses can distinguish between UHR individuals with varying clinical outcomes. Future advancements in automated transcription technology could enable the complete automation of this workflow, paving the way for a scalable supplementary screening tool to complement existing psychometric assessments.

Citation

Please cite as:

Tan SMX, Lieu MY, Kai J, Yang Z, K.K L, Lwin MO, Lee J, Goh WWB

Predicting Ultra-High Risk Outcomes Using Linguistic and Acoustic Measures From High-Risk Social Challenge Recordings: mHealth Longitudinal Cohort Exploratory Study

JMIR Form Res 2025;9:e75960

DOI: 10.2196/75960

PMID: 41468564

PMCID: 12753029

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Formative Research

Date Submitted: Apr 14, 2025

Date Accepted: Nov 12, 2025

Predicting UHR Outcomes Using Linguistic and Acoustic Measures from HiSoC Recordings: A mHealth Longitudinal Cohort Exploratory Study

ABSTRACT

Citation

Copyright