JMIR Preprints #38095: TESLEA: Medical Text Simplification using Reinforcement Learning

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

TESLEA: Medical Text Simplification using Reinforcement Learning

Atharva Phatak;
David W. Savage;
Robert Ohle;
Jonathan D. Smith;
Vijay Mago

ABSTRACT

Background:

In most cases, the abstracts of articles in the medical domain are publicly available. Although these are accessible by everyone, they are hard to comprehend for a wider audience due to the complex medical vocabulary. Thus, simplifying these complex abstracts is essential to make medical research accessible to the general public.

Objective:

This paper aims to develop a deep learning model that converts complex medical text to a simpler version while maintaining the quality of the generated text.

Methods:

A text simplification approach using Reinforcement Learning and Transformer-based language models was developed. Relevance reward, Flesch Kincaid reward and Lexical Simplicity reward were optimized to help simplify jargon-dense complex medical paragraphs to their simpler versions while retaining the quality of the text. The model was trained using 3,568 complex-simple medical paragraphs and evaluated on 480 paragraphs via the help of automated metrics and human annotation.

Results:

The proposed method outperformed previous baselines on Flesch Kincaid Scores (11.84) and achieved comparable performance to other baselines when measured using ROUGE-1 (0.39), ROUGE-2 (0.11) and SARI scores (0.40). Manual evaluation showed that percent agreement between human annotators was more than 70% when factors like fluency, coherence and adequacy were considered.

Conclusions:

A unique medical text simplification approach is successfully developed that leverages reinforcement learning and accurately simplifies complex medical paragraphs, hence increasing their readability.

Citation

Please cite as:

Phatak A, Savage DW, Ohle R, Smith JD, Mago V

Medical Text Simplification Using Reinforcement Learning (TESLEA): Deep Learning–Based Text Simplification Approach

JMIR Med Inform 2022;10(11):e38095

DOI: 10.2196/38095

PMID: 36399375

PMCID: 9719064

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Mar 18, 2022

Date Accepted: Oct 12, 2022

TESLEA: Medical Text Simplification using Reinforcement Learning

ABSTRACT

Citation

Copyright