Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Sep 19, 2023
Date Accepted: Aug 17, 2024

The final, peer-reviewed published version of this preprint can be found here:

Multifaceted Natural Language Processing Task–Based Evaluation of Bidirectional Encoder Representations From Transformers Models for Bilingual (Korean and English) Clinical Notes: Algorithm Development and Validation

Kim K, Park S, Min J, Park S, Kim J, Eun J, Jung k, Park YE, Kim E, Lee E, Lee J, Choi J

Multifaceted Natural Language Processing Task–Based Evaluation of Bidirectional Encoder Representations From Transformers Models for Bilingual (Korean and English) Clinical Notes: Algorithm Development and Validation

JMIR Med Inform 2024;12:e52897

DOI: 10.2196/52897

PMID: 39475725

PMCID: 11539635

Multifaceted NLP task-based evaluation of BERT models for bilingual (Korean and English) clinical notes: Comparative Analysis

  • Kyungmo Kim; 
  • Seongkeun Park; 
  • Jeongwon Min; 
  • Sumin Park; 
  • Juyeon Kim; 
  • Jinsu Eun; 
  • kyuha Jung; 
  • Yoobin Elyson Park; 
  • Esther Kim; 
  • Eunyoung Lee; 
  • Joonhwan Lee; 
  • Jinwook Choi

ABSTRACT

Background:

The Bidirectional Encoder Representations from Transformers (BERT) model has gained widespread use in clinical applications, such as patient classification and disease prediction. However, prior studies have had certain limitations. First, these studies often emphasized application development without thoroughly assessing the model's comprehension of clinical context. Second, comparative research on BERT models using medical documents from non-English speaking countries has been lacking, raising concerns about the applicability of BERT models trained on English clinical notes to non-English contexts. To address these gaps, our study aimed to identify the most effective BERT model for non-English clinical notes.

Objective:

This study sought to evaluate the contextual understanding abilities of various BERT models when applied to mixed Korean and English clinical notes. Our primary objective was to identify the BERT model that excels in understanding the context of such documents.

Methods:

Leveraging data from 164,460 patients in a South Korean tertiary hospital, we compared BERT-base, BERT for Biomedical Text Mining (BioBERT), Korean BERT (KoBERT), and Multilingual BERT (M-BERT). We pretrained these models to improve their contextual comprehension capabilities and subsequently compared them in seven distinct finetuning tasks.

Results:

The model performance varied based on the task and token usage. First, BERT-base and BioBERT excelled in tasks utilizing [CLS] token embeddings, such as document classification, demonstrating their effectiveness in document pattern recognition, even with limited Korean tokens in the dictionary. Second, M-BERT exhibited a superior performance in reading comprehension (RC) tasks, where better results were obtained when there were fewer occurrences of words being replaced with [UNK] tokens. Third, M-BERT excelled in the knowledge inference task, where it effectively inferred correct disease names from 63 candidate disease names when given a document wherein the disease names had been replaced with [MASK] tokens.

Conclusions:

This study highlights the effectiveness of different BERT models in a multilingual clinical domain. We anticipate that our findings will significantly benefit researchers working in the clinical field or conducting language-based investigations.


 Citation

Please cite as:

Kim K, Park S, Min J, Park S, Kim J, Eun J, Jung k, Park YE, Kim E, Lee E, Lee J, Choi J

Multifaceted Natural Language Processing Task–Based Evaluation of Bidirectional Encoder Representations From Transformers Models for Bilingual (Korean and English) Clinical Notes: Algorithm Development and Validation

JMIR Med Inform 2024;12:e52897

DOI: 10.2196/52897

PMID: 39475725

PMCID: 11539635

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.