Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Sep 18, 2023
Date Accepted: Oct 12, 2023

The final, peer-reviewed published version of this preprint can be found here:

The Impact of Multimodal Large Language Models on Health Care’s Future

Meskó B

The Impact of Multimodal Large Language Models on Health Care’s Future

J Med Internet Res 2023;25:e52865

DOI: 10.2196/52865

PMID: 37917126

PMCID: 10654899

The Impact of Multimodal Large Language Models on Healthcare’s Future

  • Bertalan Meskó

ABSTRACT

When large language models (LLMs) were introduced to the public at large in late 2022 with ChatGPT, the interest was unprecedented with more than 1 billion unique users within 90 days. Until the introduction of GPT-4 in March 2023, these LLMs were single mode—text. As medicine is a multimodal discipline, the potential future versions of LLMs that can handle multimodality — meaning they could interpret and generate not only text but also images, videos, sound, and even comprehensive documents — can be conceptualized as a significant evolution in the field of AI. This paper zooms in on the new potential of generative artificial intelligence (AI) by the achievement of multimodal inputs of text, images, and speech. We present several futuristic patient scenarios to help illustrate the potential path forward. It is important to point out though that despite the unprecedented potential of generative AI in the form of M-LLMs, the human touch in medicine remains irreplaceable. AI should be seen as a tool that can augment healthcare professionals, rather than replace them. It's also important to consider the human aspects of healthcare - empathy, understanding, and the doctor-patient relationship - when deploying AI.


 Citation

Please cite as:

Meskó B

The Impact of Multimodal Large Language Models on Health Care’s Future

J Med Internet Res 2023;25:e52865

DOI: 10.2196/52865

PMID: 37917126

PMCID: 10654899

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.