Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Formative Research

Date Submitted: Sep 29, 2023
Date Accepted: Jan 11, 2024

The final, peer-reviewed published version of this preprint can be found here:

Investigating the Impact of Prompt Engineering on the Performance of Large Language Models for Standardizing Obstetric Diagnosis Text: Comparative Study

Wang L, Bi W, Zhao S, Ma Y, Lv L, Meng C, Fu J, Lv H

Investigating the Impact of Prompt Engineering on the Performance of Large Language Models for Standardizing Obstetric Diagnosis Text: Comparative Study

JMIR Form Res 2024;8:e53216

DOI: 10.2196/53216

PMID: 38329787

PMCID: 10884897

Investigating the Impact of Prompt Engineering on LLMs Performance for Standardizing Obstetric Diagnosis Text: A Comparative Study

  • Lei Wang; 
  • Wenshuai Bi; 
  • Suling Zhao; 
  • Yinyao Ma; 
  • Longting Lv; 
  • Chenwei Meng; 
  • Jingru Fu; 
  • Hanlin Lv

ABSTRACT

Background:

In the field of obstetrics, electronic medical records capture crucial information about pregnant women, from pregnancy and delivery to postpartum recovery. This information is of vital importance for obstetrics-related research. However, the diagnostic descriptions used in electronic medical records exhibit diversity and lack of standardization, making data aggregation and analysis from multiple sources highly complex and presenting challenges for medical research. The recent advancements of ChatGPT showcase its proficiency in comprehending and generating human-like text, indicating significant potential for text extraction and standardization tasks in the medical domain.

Objective:

The study aims to utilize ChatGPT to mine and explore real-world obstetric data and to create a preliminary knowledge graph of obstetric diagnostic terminologies.

Methods:

To achieve our objective, we employed a three-step approach. Firstly, we extracted obstetric diagnostic descriptions from electronic medical records. Next, we implemented a composite strategy that integrated ChatGPT models and similarity-based methods for further processing. Furthermore, we explored and validated four different prompting techniques to identify the most effective approach for standardizing diagnostic description.

Results:

Upon conducting our experiments, we achieved promising results. The accuracy of our method was found to be competitive with the BERT model, as it achieved an impressive F1-score of 0.923. During the process, we successfully categorized 1100 diagnostic terms into 107 distinct subcategories using clustering algorithms. This categorization formed a preliminary knowledge graph.

Conclusions:

Our study demonstrates the effectiveness of utilizing ChatGPT for standardization of obstetric diagnostic descriptions from real-world data. Standardizing diagnostic descriptions enhances the precision and efficiency of obstetric diagnosis. By creating a preliminary knowledge graph with distinct subcategories, we contribute to the effective standardization and classification of large-scale real-world medical data. Overall, our research aims to offer valuable information for future obstetric research. Clinical Trial: The study was approved by the People’s Hospital of the Guangxi Zhuang Autonomous Region in China (Ref. No. KT-KJT-2021-67), and registered in ChiCTR under identifier ChiCTR2300072225.


 Citation

Please cite as:

Wang L, Bi W, Zhao S, Ma Y, Lv L, Meng C, Fu J, Lv H

Investigating the Impact of Prompt Engineering on the Performance of Large Language Models for Standardizing Obstetric Diagnosis Text: Comparative Study

JMIR Form Res 2024;8:e53216

DOI: 10.2196/53216

PMID: 38329787

PMCID: 10884897

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.