JMIR Preprints #52399: Potentials of Large Language Models in Healthcare: A Delphi Study

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Potentials of Large Language Models in Healthcare: A Delphi Study

Kerstin Denecke;
Richard May;
Antonio LLMHealthGroup;
James C. L. LLMHealthGroup;
Shauna LLMHealthGroup;
Rebecca LLMHealthGroup;
Borsi V. LLMHealthGroup;
Shaoxiong LLMHealthGroup;
Markus LLMHealthGroup;
August LLMHealthGroup;
Chris LLMHealthGroup;
Carolyn LLMHealthGroup;
Juan Ramón LLMHealthGroup;
Denis LLMHealthGroup;
Jorge LLMHealthGroup;
Jose L. LLMHealthGroup;
Emre LLMHealthGroup;
Wendy LLMHealthGroup;
Vicente LLMHealthGroup;
Jesús Daniel LLMHealthGroup;
Karin LLMHealthGroup;
Octavio Rivera Romero

ABSTRACT

Background:

A large language model (LLM) is a machine learning model inferred from text data that captures subtle patterns of language use in context. Modern LLMs are based on neural network architectures that incorporate transformer methods that allow the model to relate words together through attention to multiple words in a text sequence [1]. LLMs have been shown to be highly effective for a range of tasks in Natural Language Processing (NLP), including classification and information extraction tasks as well as generative applications such as machine translation and summarization. In late 2022, the public gained widespread awareness of such models with the release of the Artificial Intelligence system ChatGPT that is based on a generative pre-trained transformer (GPT) model.

Objective:

The aim of this adapted Delphi study was to gain insights into opinions of how researchers think LLMs might influence healthcare and what are the strengths, weaknesses, opportunities and threats (SWOT) of the use of LLMs in healthcare.

Methods:

We invited researchers in the field of health informatics, nursing informatics, and medical NLP to share their opinions on the use of LLMs in healthcare. We started the first round with open questions based on our SWOT framework. In the second and third round, the participants scored these items.

Results:

The first, second, and third rounds had 28, 23, and 21 participants, respectively. Almost all participants were affiliated with academic institutions. Agreement was reached on 103 items related to use cases, benefits, risks, reliability, adoption aspects, and the future of LLMs in healthcare. Participants offered several use cases including supporting clinical tasks, documentation tasks, and medical research and education; and agreed that LLM-based systems will act as virtual health assistants for patient education. The agreed benefits included increased efficiency in data handling and extraction, improved automation of processes, improved quality of healthcare services and overall health outcomes, provision of personalized care, accelerated diagnosis and treatment processes, and improved interaction between patients and healthcare professionals. Five risks to healthcare in general were identified: cybersecurity breaches, the potential for patient misinformation, ethical concerns, the likelihood of biased decision making, and the risk associated with inaccurate communication. Overconfidence in LLM-based systems was recognized as a risk for the medical profession. The six agreed privacy risks included the use of unregulated cloud services that compromise data security, exposure of sensitive patient data, breaches of confidentiality, fraudulent use of information, vulnerabilities in data storage and communication, and inappropriate access or use of patient data.

Conclusions:

Future research related to LLMs should not only focus on testing their possibilities for NLP-related tasks, but should also consider the workflows the methods could contribute to and the requirements regarding quality, integration, and regulations needed for successful implementation in practice.

Citation

Please cite as:

Denecke K, May R, LLMHealthGroup A, LLMHealthGroup JCL, LLMHealthGroup S, LLMHealthGroup R, LLMHealthGroup BV, LLMHealthGroup S, LLMHealthGroup M, LLMHealthGroup A, LLMHealthGroup C, LLMHealthGroup C, LLMHealthGroup JR, LLMHealthGroup D, LLMHealthGroup J, LLMHealthGroup JL, LLMHealthGroup E, LLMHealthGroup W, LLMHealthGroup V, LLMHealthGroup JD, LLMHealthGroup K, Rivera Romero O

Potential of Large Language Models in Health Care: Delphi Study

J Med Internet Res 2024;26:e52399

DOI: 10.2196/52399

PMID: 38739445

PMCID: 11130776

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Sep 2, 2023

Date Accepted: Apr 19, 2024

Potentials of Large Language Models in Healthcare: A Delphi Study

ABSTRACT

Citation

Copyright