Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Formative Research

Date Submitted: Jul 1, 2024
Date Accepted: Nov 7, 2024
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Building a Human Digital Twin (HDTwin) Using Large Language Models for Cognitive Diagnosis: Algorithm Development and Validation

Cook D

Building a Human Digital Twin (HDTwin) Using Large Language Models for Cognitive Diagnosis: Algorithm Development and Validation

JMIR Form Res 2024;8:e63866

DOI: 10.2196/63866

PMID: 39715540

PMCID: 11704625

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

HDTwin: Building a Human Digital Twin using Large Language Models for Cognitive Diagnosis

  • Diane Cook

ABSTRACT

Background:

Human digital twins have the potential to change the practice of personalizing cognitive health diagnosis because these systems can integrate multiple sources of health information and influences into a unified model. Cognitive health is multifaceted, yet researchers and clinical professionals struggle to align diverse sources of information into a single model.

Objective:

In this paper, we introduce a method called HDTwin, for unifying heterogeneous data using large language models. HDTwin is designed to predict cognitive diagnoses and offer explanations for its inferences.

Methods:

HDTwin integrates cognitive health data from multiple sources, including demographic, behavioral, EMA, n-back test, speech, and baseline experimenter interview markers. Data are converted into text prompts for a large language model. The system then combines these inputs with relevant external knowledge from the scientific literature to construct a predictive model. The model’s performance is validated using data from three studies involving n=124 participants, comparing its diagnostic accuracy with baseline machine learning classifiers.

Results:

HDTwin achieves an overall accuracy of 0.81, significantly outperforming baseline classifiers. The experiments also reveal that HDTwin yields superior predictive accuracy when information sources are fused compared to single sources. HDTwin’s chatbot interface provides interactive dialogues, aiding in diagnosis interpretation and allowing further exploration of patient data.

Conclusions:

HDTwin integrates diverse cognitive health data, enhancing the accuracy and explainability of cognitive diagnoses. This approach outperforms traditional models and provides an interactive interface for navigating patient information. The approach shows promise for improving early detection and intervention strategies in cognitive health. Clinical Trial: digital twin, smartwatch, digital behavior markers, large language models, text analysis, machine learning


 Citation

Please cite as:

Cook D

Building a Human Digital Twin (HDTwin) Using Large Language Models for Cognitive Diagnosis: Algorithm Development and Validation

JMIR Form Res 2024;8:e63866

DOI: 10.2196/63866

PMID: 39715540

PMCID: 11704625

Per the author's request the PDF is not available.