Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Dec 31, 2019
Date Accepted: Mar 19, 2020

The final, peer-reviewed published version of this preprint can be found here:

A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development

Li L, Wang P, Wang Y, Wang S, Yan J, Jiang J, Tang B, Wang C, Liu Y

A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development

JMIR Med Inform 2020;8(5):e17645

DOI: 10.2196/17645

PMID: 32436854

PMCID: 7273238

PrTransX: An effective method to learn embedding of probabilistic medical knowledge graph

  • Linfeng Li; 
  • Peng Wang; 
  • Yao Wang; 
  • Shenghui Wang; 
  • Jun Yan; 
  • Jinpeng Jiang; 
  • Buzhou Tang; 
  • Chengliang Wang; 
  • Yuting Liu

ABSTRACT

Background:

Knowledge graph embedding is an effective semantic representation method for entities and relations in knowledge graphs. Several translation-based algorithms, including TransE, TransH, TransR, TransD and TranSparse, have been proposed to learn effective embedding vectors from typical knowledge graphs in which the relations between head and tail entities are deterministic.

Objective:

However, in medical knowledge graphs, the relations between head and tail entities are inherently probabilistic. Such a difference introduced challenge in embedding medical knowledge graphs: how to learn the probability values of triplets into representation vectors.

Methods:

To address the challenge, enhancements are made to existing TransX (X=E/H/R/D/Sparse) algorithms, including the following: 1) constructing a mapping function between the score value and the probability and 2) introducing probability-based loss of triplets into the original margin-based loss function.

Results:

The proposed PrTransX algorithm is performed on a medical knowledge graph that was built from large-scale real-world electronic medical records (EMR) data. The embeddings are evaluated using link prediction task. Comparing with the corresponding TransX algorithm, the proposed PrTransX performs better in all evaluation indicators, achieving a higher Hit@10 and NDCG@10 and lower Mean Rank.

Conclusions:

We can conclude that the proposed PrTransX successfully incorporated the uncertainty of the knowledge triplets into the embedding vectors.


 Citation

Please cite as:

Li L, Wang P, Wang Y, Wang S, Yan J, Jiang J, Tang B, Wang C, Liu Y

A Method to Learn Embedding of a Probabilistic Medical Knowledge Graph: Algorithm Development

JMIR Med Inform 2020;8(5):e17645

DOI: 10.2196/17645

PMID: 32436854

PMCID: 7273238

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.