JMIR Preprints #17653: Building a Medical Knowledge Graph to Enhance the Fraud, Waste and Abuse Detection on Claim Data

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Building a Medical Knowledge Graph to Enhance the Fraud, Waste and Abuse Detection on Claim Data

Jin Xiao;
Haixia Sun;
Yilong He;
Lingyu Cao;
Sheng Zhang;
Wei Zhu;
Xiaowei Xu;
Li Hou;
Jiao Li;
Yuan Ni;
Guotong Xie

ABSTRACT

Background:

Detection of Fraud, Waste and Abuse (FWA) is an important yet challenging problem for the health insurance industry. A key step in the FWA detection is to check the clinical reasonableness of the claim, e.g. Whether the drug has an indication for the diagnosis. Currently, it requires human experts with enough medical knowledge to do this. To reduce the cost, there is a trend to build an intelligent system to automatically detect suspicious claims with impropriate diagnosis/medication. To implement such a system, the core foundation is a comprehensive and reliable medical knowledge graph (MKG).

Objective:

This study aimed to build a knowledge graph helping fraud, waste and abuse detection on claim data.

Methods:

In this paper, we defined the schema of a medical knowledge graph for the claim data processing. We applied Chinese natural language processing (NLP) methods to automatically build the medical knowledge graph from the unstructured knowledge sources such as insurance agreement documents, textbooks, drug labels, etc. In the instant identification, we integrated machine learning method with knowledge-driven method to identify symptoms, diagnosis, treatment, prognosis, examination and drug. In the relation identification, we used the distant supervision method to conduct the extraction relations including indication, contraindication, side effects, interaction, taboo, check, and sign. Then we developed a tool to enable the humanmachine collaboration for knowledge graph fusion and knowledge graph quality control. Finally, we apply the MKG in the claim processing to assist the FWA detection.

Results:

We collected 185,796 drug labels from China Food and Drug Administration (CFDA), 88,892 kinds of disease information from medical textbooks including symptoms, diagnosis, treatment, prognosis, and 5,272 examinations and examination information as the knowledge sources. The final medical knowledge graph includes 1,616,549 nodes and 5,963,444 edges. We conducted extensive experiments to validate the accuracy of the named entity recognition (NER) and relation extraction (RE). Our NER method achieved F1 score of 0.83 and our RE obtained F1 score of 0.90. In addition, we randomly selected 100 claim documents and applied the MKG to detect the suspected claims. The MKG was assisted to detect 70% of the suspected claims.

Conclusions:

In this paper, we automatically constructed a medical knowledge graph (MKG) for FWA detection. The MKG based method successfully identified FWA suspected (such as fraud diagnosis, excess prescription and irrational prescription) from the claim documents, which helped to improve the efficiency of the claim processing.

Citation

Please cite as:

Xiao J, Sun H, He Y, Cao L, Zhang S, Zhu W, Xu X, Hou L, Li J, Ni Y, Xie G

Medical Knowledge Graph to Enhance Fraud, Waste, and Abuse Detection on Claim Data: Model Development and Performance Evaluation

JMIR Med Inform 2020;8(7):e17653

DOI: 10.2196/17653

PMID: 32706714

PMCID: 7413281

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Dec 31, 2019

Date Accepted: May 28, 2020

Building a Medical Knowledge Graph to Enhance the Fraud, Waste and Abuse Detection on Claim Data

ABSTRACT

Citation

Copyright

JMIR Preprints

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Dec 31, 2019

Date Accepted: May 28, 2020

Building a Medical Knowledge Graph to Enhance the Fraud, Waste and Abuse Detection on Claim Data

ABSTRACT

Citation

The author of this paper has made a PDF available, but requires the user to login, or create an account.

Copyright