JMIR Preprints #56614: Redefining Health Care Data Interoperability: An Empirical Exploration of Large Language Models in Information Exchange

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Redefining Health Care Data Interoperability: An Empirical Exploration of Large Language Models in Information Exchange

Dukyong Yoon;
Changho Han;
Dong Won Kim;
Songsoo Kim;
SungA Bae

ABSTRACT

Background:

Efficient data exchange and health care interoperability are impeded by medical records often being in non-standardized or unstructured natural language format. Advanced language models, such as large language models (LLMs), may help overcome current challenges in information exchange.

Objective:

This study evaluates the capability of LLMs in transforming and transferring health care data to support interoperability.

Methods:

Utilizing data from the Medical Information Mart for Intensive Care III and UK Biobank, the study conducted three experiments. Experiment 1 assessed the accuracy of transforming structured lab results into unstructured format. Experiment 2 explored the conversion of diagnostic codes between the coding frameworks of the International Classification of Diseases, Ninth Revision, Clinical Modification and SNOMED Clinical Terms using a traditional mapping table and a text-based approach facilitated by the LLM ChatGPT. Experiment 3 focused on extracting targeted information from unstructured records that included comprehensive clinical information (discharge notes).

Results:

The text-based approach showed a high conversion accuracy in transforming lab results (Experiment 1) and an enhanced consistency in diagnostic code conversion, particularly for frequently used diagnostic names, compared with the traditional mapping approach (Experiment 2). In Experiment 3, the LLM showed a positive predictive value of 87.2% in extracting generic drug names.

Conclusions:

This study highlighted the potential role of LLMs in significantly improving health care data interoperability, demonstrated by their high accuracy and efficiency in data transformation and exchange. The LLMs hold vast potential for enhancing medical data exchange without complex standardization for medical terms and data structure. Clinical Trial: N/A

Citation

Please cite as:

Yoon D, Han C, Kim DW, Kim S, Bae S

Redefining Health Care Data Interoperability: Empirical Exploration of Large Language Models in Information Exchange

J Med Internet Res 2024;26:e56614

DOI: 10.2196/56614

PMID: 38819879

PMCID: 11179014

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jan 22, 2024

Date Accepted: Apr 27, 2024

Redefining Health Care Data Interoperability: An Empirical Exploration of Large Language Models in Information Exchange

ABSTRACT

Citation

Copyright