JMIR Preprints #28001: A Multidimensional Data Fusion Model Based on Deep Learning for a Patient Similarity Network

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

A Multidimensional Data Fusion Model Based on Deep Learning for a Patient Similarity Network

Mohamed Adel Serhani;
Hadeel T. El Kassabi;
Hadeel T. El Kassabi;
Aberrahim Oulahaj;
Khaled Khalil

ABSTRACT

Background:

Precision medicine is a novel approach for patient care. It allows the prescription of the appropriate drug as well as suitable treatments to the right patient at the right time. It can be envisioned as the comparison of a new patient with existing patients having similar characteristics, which can be referred to as patient similarity. Several statistical, data mining, and deep learning models have been used to build and apply patient similarity network (PSN) for various purposes. However, the challenges associated with data heterogeneity and dimensionality make it difficult to use a single model that addresses both the challenges of reducing data dimensionality and capturing features of diverse data types, including contextual and longitudinal data. Furthermore, when applying multiple models, we can observe the additional challenges associated with the development of an optimum aggregation scheme that maintains high accuracy and preserves data veracity.

Objective:

In this study, we propose a multi-model PSN that considers heterogeneous data with static and dynamic characteristics for disease diagnosis for improving prediction accuracy. The static data model manages the data obtained from patient profiles, whereas the dynamic data model manages longitudinal data from patient treatment pathways and clinical data.

Methods:

We propose a combination of deep learning models and patient similarity network to obtain abundant clinical evidence and extract relevant information based on which similar patients can be explored and compared, thereby obtaining more accurate and comprehensive diagnosis and recommendations. We use the bidirectional encoder representations from transformers (BERT) to process and analyze the contextual data and generate word embedding, where semantic features are captured using a CNN. Dynamic data is analyzed using a long–short-term memory (LSTM)-based autoencoder, which reduces data dimensionality while preserving the temporal features of the data. Furthermore, we propose an aggregation-based fusion approach in which temporal data and clinical narrative data are combined for estimating the patient similarity.

Results:

We evaluated our proposed method through a series of experiments. The obtained results proved that our proposed deep learning-based PSN fusion model provides higher classification accuracy in determining various patient health outcomes when compared with other traditional classification algorithms.

Conclusions:

Our multi-model highlights the intensity of the similarity between pairs of patients, thereby realizing precise diagnosis and recommendations for a new patient.

Citation

Please cite as:

Serhani MA, T. El Kassabi H, T. El Kassabi H, Oulahaj A, Khalil K

A Multidimensional Data Fusion Model Based on Deep Learning for a Patient Similarity Network

JMIR Preprints. 16/02/2021:28001

DOI: 10.2196/preprints.28001

URL: https://preprints.jmir.org/preprint/28001

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Previously submitted to: Journal of Medical Internet Research (no longer under consideration since Jan 12, 2022)

Date Submitted: Feb 16, 2021

Open Peer Review Period: Oct 7, 2021 - Nov 26, 2021

(closed for review but you can still tweet)

NOTE: This is an unreviewed Preprint

A Multidimensional Data Fusion Model Based on Deep Learning for a Patient Similarity Network

ABSTRACT

Citation

Copyright