Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: Oct 7, 2025
Open Peer Review Period: Oct 8, 2025 - Dec 3, 2025
Date Accepted: Jan 9, 2026
(closed for review but you can still tweet)
Multimodal AI for Alzheimer’s Disease Diagnosis: A Systematic Review of Datasets, Models and Modalities
ABSTRACT
Background:
Alzheimer’s disease (AD) is a major global health challenge, characterised by progressive cognitive decline and rising prevalence due to population ageing. Despite increased awareness, a substantial diagnostic gap remains, underscoring the urgent need for scalable and accurate early detection methods. Artificial intelligence (AI) provides a promising solution by integrating multimodal data to enhance diagnostic accuracy and clinical decision-making. However, current research remains fragmented, with studies often focusing separately on clinical phenotyping or linguistic-based cognitive impairment data.
Objective:
This review aims to systematically synthesise recent advances in multimodal AI approaches for AD diagnosis and risk prediction, bridging the gap between clinical phenotyping and linguistic-based cognitive assessment research.
Methods:
Following the PRISMA 2020 guidelines, a comprehensive search was conducted across major databases to identify studies published in the past five years that applied multimodal AI models for AD diagnosis or cognitive impairment detection. A total of 97 eligible studies were included for qualitative synthesis. Extracted data covered modality combinations, AI modelling strategies, evaluation metrics, and dataset usage.
Results:
Multimodal AI models consistently outperformed single-modality approaches in diagnostic accuracy and generalisability. Deep learning and ensemble learning frameworks were the most frequently adopted methods, offering strong feature-fusion capabilities. However, methodological heterogeneity, data imbalance, and limited dataset accessibility were recurring challenges. The review further summarises representative modality pairings such as imaging + clinical, imaging + genetic, and speech + cognitive features and provides an overview of publicly available multimodal datasets to facilitate reproducibility and benchmarking.
Conclusions:
This systematic review provides a comprehensive overview of multimodal AI models for AD diagnosis, uniquely integrating both clinical phenotyping and linguistic-based cognitive assessment perspectives. It highlights key methodological trends, dataset limitations, and opportunities for future research toward standardised, interpretable, and generalisable AI-driven diagnostic frameworks for Alzheimer’s disease.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.