Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Formative Research

Date Submitted: Sep 5, 2023
Date Accepted: Jan 20, 2024

The final, peer-reviewed published version of this preprint can be found here:

Efficient Machine Reading Comprehension for Health Care Applications: Algorithm Development and Validation of a Context Extraction Approach

Nguyen DA, Li M, Lambert G, Kowalczyk R, McDonald R, Vo BQ

Efficient Machine Reading Comprehension for Health Care Applications: Algorithm Development and Validation of a Context Extraction Approach

JMIR Form Res 2024;8:e52482

DOI: 10.2196/52482

PMID: 38526545

PMCID: 11002730

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

An efficient approach to machine reading comprehension for healthcare applications based on context extraction

  • Duy-Anh Nguyen; 
  • Minyi Li; 
  • Gavin Lambert; 
  • Ryszard Kowalczyk; 
  • Rachael McDonald; 
  • Bao Quoc Vo

ABSTRACT

Background:

Machine reading comprehension (MRC) tasks have been developed, with state-of-the-art extractive methods having achieved comparable or better accuracy than human performance on benchmark datasets. However, such models are not as successful when adapted to complex domains such as healthcare. One of the main reasons is that the context that the MRC model needs to process when operating in a complex domain can be much larger compared to an average open domain context. A potential solution to this problem is reducing the input context to the MRC model by extracting only the necessary parts from the original context.

Objective:

This study aims to develop a method for extracting useful contexts from long articles for Question Answering to enable the MRC model to work more efficiently and accurately.

Methods:

Existing approaches to context extraction in MRC were based on sentence selection strategies, in which the models were trained to find the sentence(s) containing the answer. We found that using just the sentences containing the answer was insufficient for the MRC model to predict correctly. We conducted a series of empirical studies and observed a strong relation between the usefulness of the context and the confidence score output by the MRC model. We also found a strong relationship between the model’s confidence in its predictions and the prediction accuracy. We propose a method to estimate the utility of each sentence in a context in answering the question, then extract a new, shorter context according to these estimations. We generated a dataset to train two models for estimating sentence utility, based on which we select more precise contexts that improve the MRC model’s performance.

Results:

We demonstrated our approach on the COVID-QA and BioASQ datasets and showed that our proposed context extraction approach benefits the downstream MRC model. First, our proposed method significantly reduces the inference time of the entire QA system by 6 - 7 times. Second, our approach helps the MRC model predict the answer more accurately compared to using the original context (2.76% increase in F1 score for COVID-QA and 8.14% increase in F1 score for BioASQ). We also found a potential problem in extractive Transformers MRC models where they predict poorly when given a shorter, more precise context in some cases.

Conclusions:

Utilization of the proposed context extraction method allows the MRC model to achieve improved prediction accuracy and significantly reduces MRC inference time.


 Citation

Please cite as:

Nguyen DA, Li M, Lambert G, Kowalczyk R, McDonald R, Vo BQ

Efficient Machine Reading Comprehension for Health Care Applications: Algorithm Development and Validation of a Context Extraction Approach

JMIR Form Res 2024;8:e52482

DOI: 10.2196/52482

PMID: 38526545

PMCID: 11002730

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.