Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Feb 28, 2020
Date Accepted: Aug 2, 2021

The final, peer-reviewed published version of this preprint can be found here:

Defining Patient-Oriented Natural Language Processing: A New Paradigm for Research and Development to Facilitate Adoption and Use by Medical Experts

Sarker A, Al-Garadi MA, Yang YC, Choi J, Quyyumi AA, Martin GS

Defining Patient-Oriented Natural Language Processing: A New Paradigm for Research and Development to Facilitate Adoption and Use by Medical Experts

JMIR Med Inform 2021;9(9):e18471

DOI: 10.2196/18471

PMID: 34581670

PMCID: 8512184

Patient-oriented natural language processing: Defining a new paradigm for research and development to facilitate adoption and utilization by medical experts

  • Abeed Sarker; 
  • Mohammed Ali Al-Garadi; 
  • Yuan-Chi Yang; 
  • Jinho Choi; 
  • Arshed A Quyyumi; 
  • Greg S Martin

ABSTRACT

The capabilities of natural language processing (NLP) methods have expanded significantly in recent years, particularly driven by advances in data science and machine learning. However, the utilization of NLP for patient-oriented clinical research and care (POCRC) is still limited. A primary reason behind this is perhaps the fact that clinical NLP methods are developed, optimized, and evaluated on narrow-focus datasets and tasks (e.g., for the detection of specific symptoms from free texts). Such research and development (R&D) approaches may be described as problem-oriented, and the developed systems only perform well for a given specialized task. As standalone systems, they are also typically not suitable for addressing the needs of POCRC, leaving a gap between the capabilities of clinical NLP methods and the needs of patient-facing medical experts. We believe that to make clinical NLP systems more valuable, future R&D efforts need to follow a new research paradigm, one that explicitly incorporates characteristics that are crucial for POCRC. We present our viewpoint about four interrelated characteristics, three representing NLP system properties and one associated with the R&D process—(i) generalizability (capability to characterize patients, not clinical problems), (ii) interpretability (ability to explain system decisions), (iii) customizability (flexibility for adaptation to distinct settings, problems and cohorts), and (iv) cross-evaluation (validated performance on heterogeneous datasets)—that are relevant for NLP systems suitable for POCRC. Using the NLP task of clinical concept detection as an example, we detail these characteristics and discuss how they may lead to increased uptake of NLP systems for POCRC.


 Citation

Please cite as:

Sarker A, Al-Garadi MA, Yang YC, Choi J, Quyyumi AA, Martin GS

Defining Patient-Oriented Natural Language Processing: A New Paradigm for Research and Development to Facilitate Adoption and Use by Medical Experts

JMIR Med Inform 2021;9(9):e18471

DOI: 10.2196/18471

PMID: 34581670

PMCID: 8512184

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.