Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Jul 26, 2025
Date Accepted: Nov 25, 2025
Mild Cognitive Impairment Detection System Based on Unstructured Spontaneous Speech: Longitudinal Dual-modal Framework
ABSTRACT
Background:
In recent years, the incidence of cognitive diseases has also risen with the significant increase in population aging. Among these diseases, Alzheimer’s disease constitutes a substantial proportion, placing a high-cost burden on healthcare systems. To give early treatment and slow the progression of patient deterioration, it is crucial to diagnose Mild Cognitive Impairment (MCI), a transitional stage.
Objective:
In this study, we utilize autobiographical memory (AM) test speech data to establish a dual-modal longitudinal cognitive detection system for MCI. The AM test is a psychological assessment method that evaluates the cognitive status of subjects as they freely narrate important life experiences.
Methods:
Identifying hidden disease-related information in unstructured, spontaneous speech is more difficult than in structured speech. To improve this process, we use both speech and text data, which provide more clues about a person’s cognitive state. In addition, to track how cognition changes over time in spontaneous speech, we introduce an aging trajectory module. This module uses local and global alignment loss functions to better learn time-related features by aligning cognitive changes across different time points.
Results:
In our experiments on the Chinese dataset, the longitudinal model incorporating the aging trajectory module achieved AUROC of 84.81% and 88.59% on two datasets, respectively, showing significant improvement over cross-sectional, single time-point models. We also conducted ablation studies to verify the necessity of the proposed aging trajectory module. To confirm that the model not only applies to autobiographical memory test data, we used part of the model to evaluate the performance on the ADReSSo dataset, a single-time-point semi-structured data for validation, with results showing an accuracy exceeding 88.05%.
Conclusions:
This study presents a non-invasive and scalable approach for early MCI detection by leveraging autobiographical memory speech data across multiple time points. Through dual-modal analysis and the introduction of an aging trajectory module, our system effectively captures cognitive decline trends over time. Experimental results demonstrate the method’s robustness and generalizability, highlighting its potential for real-world, long-term cognitive monitoring.
Citation
The author of this paper has made a PDF available, but requires the user to login, or create an account.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.