JMIR Preprints #53216: Investigating the Impact of Prompt Engineering on LLMs Performance for Standardizing Obstetric Diagnosis Text: A Comparative Study

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Investigating the Impact of Prompt Engineering on LLMs Performance for Standardizing Obstetric Diagnosis Text: A Comparative Study

Lei Wang;
Wenshuai Bi;
Suling Zhao;
Yinyao Ma;
Longting Lv;
Chenwei Meng;
Jingru Fu;
Hanlin Lv

ABSTRACT

Background:

In the field of obstetrics, electronic medical records capture crucial information about pregnant women, from pregnancy and delivery to postpartum recovery. This information is of vital importance for obstetrics-related research. However, the diagnostic descriptions used in electronic medical records exhibit diversity and lack of standardization, making data aggregation and analysis from multiple sources highly complex and presenting challenges for medical research. The recent advancements of ChatGPT showcase its proficiency in comprehending and generating human-like text, indicating significant potential for text extraction and standardization tasks in the medical domain.

Objective:

The study aims to utilize ChatGPT to mine and explore real-world obstetric data and to create a preliminary knowledge graph of obstetric diagnostic terminologies.

Methods:

To achieve our objective, we employed a three-step approach. Firstly, we extracted obstetric diagnostic descriptions from electronic medical records. Next, we implemented a composite strategy that integrated ChatGPT models and similarity-based methods for further processing. Furthermore, we explored and validated four different prompting techniques to identify the most effective approach for standardizing diagnostic description.

Results:

Upon conducting our experiments, we achieved promising results. The accuracy of our method was found to be competitive with the BERT model, as it achieved an impressive F1-score of 0.923. During the process, we successfully categorized 1100 diagnostic terms into 107 distinct subcategories using clustering algorithms. This categorization formed a preliminary knowledge graph.

Conclusions:

Our study demonstrates the effectiveness of utilizing ChatGPT for standardization of obstetric diagnostic descriptions from real-world data. Standardizing diagnostic descriptions enhances the precision and efficiency of obstetric diagnosis. By creating a preliminary knowledge graph with distinct subcategories, we contribute to the effective standardization and classification of large-scale real-world medical data. Overall, our research aims to offer valuable information for future obstetric research. Clinical Trial: The study was approved by the People’s Hospital of the Guangxi Zhuang Autonomous Region in China (Ref. No. KT-KJT-2021-67), and registered in ChiCTR under identifier ChiCTR2300072225.

Citation

Please cite as:

Wang L, Bi W, Zhao S, Ma Y, Lv L, Meng C, Fu J, Lv H

Investigating the Impact of Prompt Engineering on the Performance of Large Language Models for Standardizing Obstetric Diagnosis Text: Comparative Study

JMIR Form Res 2024;8:e53216

DOI: 10.2196/53216

PMID: 38329787

PMCID: 10884897

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Formative Research

Date Submitted: Sep 29, 2023

Date Accepted: Jan 11, 2024

Investigating the Impact of Prompt Engineering on LLMs Performance for Standardizing Obstetric Diagnosis Text: A Comparative Study

ABSTRACT

Citation

Copyright