Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Previously submitted to: Journal of Medical Internet Research (no longer under consideration since Jan 12, 2022)

Date Submitted: Feb 16, 2021
Open Peer Review Period: Oct 7, 2021 - Nov 26, 2021
(closed for review but you can still tweet)

NOTE: This is an unreviewed Preprint

Warning: This is a unreviewed preprint (What is a preprint?). Readers are warned that the document has not been peer-reviewed by expert/patient reviewers or an academic editor, may contain misleading claims, and is likely to undergo changes before final publication, if accepted, or may have been rejected/withdrawn (a note "no longer under consideration" will appear above).

Peer review me: Readers with interest and expertise are encouraged to sign up as peer-reviewer, if the paper is within an open peer-review period (in this case, a "Peer Review Me" button to sign up as reviewer is displayed above). All preprints currently open for review are listed here. Outside of the formal open peer-review period we encourage you to tweet about the preprint.

Citation: Please cite this preprint only for review purposes or for grant applications and CVs (if you are the author).

Final version: If our system detects a final peer-reviewed "version of record" (VoR) published in any journal, a link to that VoR will appear below. Readers are then encourage to cite the VoR instead of this preprint.

Settings: If you are the author, you can login and change the preprint display settings, but the preprint URL/DOI is supposed to be stable and citable, so it should not be removed once posted.

Submit: To post your own preprint, simply submit to any JMIR journal, and choose the appropriate settings to expose your submitted version as preprint.

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

A Multidimensional Data Fusion Model Based on Deep Learning for a Patient Similarity Network

  • Mohamed Adel Serhani; 
  • Hadeel T. El Kassabi; 
  • Hadeel T. El Kassabi; 
  • Aberrahim Oulahaj; 
  • Khaled Khalil

ABSTRACT

Background:

Precision medicine is a novel approach for patient care. It allows the prescription of the appropriate drug as well as suitable treatments to the right patient at the right time. It can be envisioned as the comparison of a new patient with existing patients having similar characteristics, which can be referred to as patient similarity. Several statistical, data mining, and deep learning models have been used to build and apply patient similarity network (PSN) for various purposes. However, the challenges associated with data heterogeneity and dimensionality make it difficult to use a single model that addresses both the challenges of reducing data dimensionality and capturing features of diverse data types, including contextual and longitudinal data. Furthermore, when applying multiple models, we can observe the additional challenges associated with the development of an optimum aggregation scheme that maintains high accuracy and preserves data veracity.

Objective:

In this study, we propose a multi-model PSN that considers heterogeneous data with static and dynamic characteristics for disease diagnosis for improving prediction accuracy. The static data model manages the data obtained from patient profiles, whereas the dynamic data model manages longitudinal data from patient treatment pathways and clinical data.

Methods:

We propose a combination of deep learning models and patient similarity network to obtain abundant clinical evidence and extract relevant information based on which similar patients can be explored and compared, thereby obtaining more accurate and comprehensive diagnosis and recommendations. We use the bidirectional encoder representations from transformers (BERT) to process and analyze the contextual data and generate word embedding, where semantic features are captured using a CNN. Dynamic data is analyzed using a long–short-term memory (LSTM)-based autoencoder, which reduces data dimensionality while preserving the temporal features of the data. Furthermore, we propose an aggregation-based fusion approach in which temporal data and clinical narrative data are combined for estimating the patient similarity.

Results:

We evaluated our proposed method through a series of experiments. The obtained results proved that our proposed deep learning-based PSN fusion model provides higher classification accuracy in determining various patient health outcomes when compared with other traditional classification algorithms.

Conclusions:

Our multi-model highlights the intensity of the similarity between pairs of patients, thereby realizing precise diagnosis and recommendations for a new patient.


 Citation

Please cite as:

Serhani MA, T. El Kassabi H, T. El Kassabi H, Oulahaj A, Khalil K

A Multidimensional Data Fusion Model Based on Deep Learning for a Patient Similarity Network

JMIR Preprints. 16/02/2021:28001

DOI: 10.2196/preprints.28001

URL: https://preprints.jmir.org/preprint/28001

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.