JMIR Preprints #79202: Large Language Models for Healthcare Text Classification: A Systematic Review

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Large Language Models for Healthcare Text Classification: A Systematic Review

Hajar Sakai;
Sarah Lam

ABSTRACT

Background:

Large Language Models (LLMs) have fundamentally transformed approaches to Natural Language Processing (NLP) tasks across diverse domains. In healthcare, accurate and cost-efficient text classification is crucial for clinical notes analysis, diagnosis coding, and other tasks, where LLMs present promising potential. Text classification faces multiple challenges including manual annotation for training, handling imbalanced data, and developing scalable approaches. Healthcare adds additional challenges, particularly the critical need to preserve patients' data privacy and the complexity of medical terminology. Existing systematic reviews about LLMs either don't specialize in text classification or don't focus on the healthcare domain.

Objective:

This research synthesizes and critically evaluates the current evidence found in the literature regarding the use of LLMs for text classification in a healthcare setting.

Methods:

Major databases (e.g., Google Scholar, Scopus, PubMed, Science Direct) and other resources were queried, focusing on papers published between 2018 and 2024 within the framework of PRISMA guidelines. Studies were categorized by text classification type (e.g., binary classification, multi-label classification), application (e.g., clinical decision support, public health and opinion analysis), methodology, type of healthcare text, and metrics used for evaluation and validation.

Results:

The systematic review resulted in 65 eligible research articles that leveraged LLMs for automated healthcare text classification and contrasted results with existing machine learning-based methods in which embedding, annotation, and training are traditionally required.

Conclusions:

This review reveals existing gaps in the literature and suggests future research lines that can be investigated and explored regarding LLMs for healthcare text classification. Clinical Trial: N/A

Citation

Please cite as:

Sakai H, Lam S

Large Language Models for Health Care Text Classification: Systematic Review

JMIR AI 2026;5:e79202

DOI: 10.2196/79202

PMID: 41672471

PMCID: 12936667

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR AI

Date Submitted: Jun 16, 2025

Date Accepted: Dec 22, 2025

Large Language Models for Healthcare Text Classification: A Systematic Review

ABSTRACT

Citation

Copyright