Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Feb 27, 2021
Date Accepted: May 5, 2021

The final, peer-reviewed published version of this preprint can be found here:

Document Retrieval for Precision Medicine Using a Deep Learning Ensemble Method

Liu Z, Feng J, Yang Z, Wang L

Document Retrieval for Precision Medicine Using a Deep Learning Ensemble Method

JMIR Med Inform 2021;9(6):e28272

DOI: 10.2196/28272

PMID: 34185006

PMCID: 8278302

An ensemble information retrieval method for the biomedical domain

  • Zhiqiang Liu; 
  • Jingkun Feng; 
  • Zhihao Yang; 
  • Lei Wang

ABSTRACT

Background:

With the development of biomedicine, the number of biomedical documents has increased rapidly, which brings a great challenge for researchers retrieving the information they need. Information retrieval aims to meet this challenge by searching relevant documents from abundant documents based on the given query. However, sometimes the relevance of search results needs to be evaluated from multiple aspects in some specific retrieval tasks and thereby increases the difficulty of biomedical information retrieval.

Objective:

This study aims to find a more systematic method to retrieve relevant scientific literature for a given patient.

Methods:

In the initial retrieval stage, we supplement query terms through query expansion strategies and apply query boosting to obtain an initial ranking list of relevant documents. In the re-ranking phase, we employ a text classification model and relevance matching model to evaluate documents respectively from different dimensions, then we combine the outputs through logistic regression to re-rank all the documents from the initial ranking list.

Results:

The proposed ensemble method contributes to the improvement of biomedical retrieval performance. Comparing with the existing deep learning-based methods, experimental results show that our method achieves state-of-the-art performance on the data collection provided by TREC 2019 Precision Medicine Track.

Conclusions:

In this paper, we propose a novel ensemble method based on deep learning. As shown in the experiments, the strategies we used in the initial retrieval phase such as query expansion and query boosting are effective. The application of the text classification model and the relevance matching model can better capture semantic context information and improve retrieval performance.


 Citation

Please cite as:

Liu Z, Feng J, Yang Z, Wang L

Document Retrieval for Precision Medicine Using a Deep Learning Ensemble Method

JMIR Med Inform 2021;9(6):e28272

DOI: 10.2196/28272

PMID: 34185006

PMCID: 8278302

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.