Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Previously submitted to: Journal of Medical Internet Research (no longer under consideration since Sep 13, 2022)

Date Submitted: May 15, 2022

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Heterogeneous Semantic Graph Construction and HinSAGE Learning from Electronic Medical Records

  • Ha Na Cho; 
  • Imjin Ahn; 
  • Hansle Gwon; 
  • Heejun Kang; 
  • Yunha Kim; 
  • Heejung Choi; 
  • Minkyoung Kim; 
  • Jiye Han; 
  • Gaeun Kee; 
  • Tae Joon Jun; 
  • Young-Hak Kim

ABSTRACT

Background:

Graph representations learning is a method for introducing how to effectively construct and learn patient embeddings using electronic medical records (EMR) datasets. Adapting the integration limits will support and advance the previous methods to predict the prognosis of patients in network models.

Objective:

This study aimed to address the challenge of implementing complex and large EMR, including the following: (1) demonstrating how to build a multi-attributed and multi-relational graph model (2) and applying a downstream disease prediction task of patient’s prognosis using HinSAGE algorithm.

Methods:

A total of 61,789 patients diagnosed with angina who visited the Asan Medical Center between 2000 and 2016 were included in the study. We illustrated two bipartite graph representations from real-world EMR data sets, to effectively visualize the patients’ representation and apply the heterogeneous graph neural network using the HinSAGE algorithm. Then, we performed the link prediction task for predicting the event occurrence of myocardial infarction, stroke, heart failure and death on binary edge attributes to evaluate the network performance.

Results:

The first constructed graph model contained 492,886 nodes with attributes and 439,045 edges for displaying 10 nodes and 9 edge types. The patient-centric graph database was then used to illustrate a query of a predictive network. Moreover, the second graph model is composed of 107,682 nodes and 53,841 edges with node and edge attributes for displaying 2 node and 2 edge types. Next, the model object underwent an experiment using the HinSAGE algorithm. As a result, the performance evaluation indicated that our heterogeneous graph model outperformed other baseline methods, achieving the area under a receiver operating characteristic curve (AUROC) and the area under precision-recall curve (AUPRC) measurements of 0.69 and 0.17, respectively.

Conclusions:

The proposed implemented models successfully demonstrated the graph construction and graph representation learning on the EMR dataset to empower three roles: broaden EMR research, decision making in diagnosis, and personalized medicine. Future works may integrate additional data sources alongside the node and edge types to further improve the performance evaluation.


 Citation

Please cite as:

Cho HN, Ahn I, Gwon H, Kang H, Kim Y, Choi H, Kim M, Han J, Kee G, Jun TJ, Kim YH

Heterogeneous Semantic Graph Construction and HinSAGE Learning from Electronic Medical Records

JMIR Preprints. 15/05/2022:39522

DOI: 10.2196/preprints.39522

URL: https://preprints.jmir.org/preprint/39522

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.