Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: Sep 2, 2023
Date Accepted: Apr 19, 2024
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Potentials of Large Language Models in Healthcare: A Delphi Study
ABSTRACT
Background:
A large language model (LLM) is a type of artificial intelligence (AI) algorithm that uses deep learning techniques and massively large data sets to understand, summarize, generate and predict new content. Modern LLMs use transformer-based models or short "transformers", which are neural networks and have been tested for various tasks in Natural Language Processing (NLP). In late 2022, such models gained widespread awareness with the release of ChatGPT which uses generative pre-trained transformer (GPT) models.
Objective:
The aim of this adapted Delphi study was to gain insights into opinions of how researchers think LLMs might influence healthcare and what are the strengths, weaknesses, opportunities and threats (SWOT) of the use of LLMs in healthcare.
Methods:
We invited researchers in the field of health informatics, nursing informatics, and medical NLP to share their opinions on the use of LLMs in healthcare. We started the first round with open questions based on our SWOT framework. In the second and third round, the participants scored these items.
Results:
The first, second, and third rounds had 28, 23, and 21 participants, respectively. Almost all participants were affiliated with academic institutions. Agreement was reached on 103 items related to use cases, benefits, risks, reliability, adoption aspects, and the future of LLMs in healthcare. Participants offered a multitude of use cases showing the potential value of LLMs; however, many shortcomings were also identified.
Conclusions:
Future research related to LLMs should not only focus on testing their possibilities for natural language related tasks, but also should consider the workflows the methods could contribute to and the requirements regarding quality, integration, and regulations needed for successful implementation in practice.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.