Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Formative Research

Date Submitted: Aug 24, 2025
Date Accepted: Feb 18, 2026

The final, peer-reviewed published version of this preprint can be found here:

Ontology-Based Medication Named Entity Recognition Using Pretrained Transformer Models From a Thai Hospital: Model Fine-Tuning and Validation Study

Isaradech N, Sirikul W, Schulz S, Kreuzthaler ME

Ontology-Based Medication Named Entity Recognition Using Pretrained Transformer Models From a Thai Hospital: Model Fine-Tuning and Validation Study

JMIR Form Res 2026;10:e82685

DOI: 10.2196/82685

PMID: 41861368

Ontology-based Medication Extraction: Named Entity Recognition using Pre-trained Transformer Models from a Thai Hospital

  • Natthanaphop Isaradech; 
  • Wachiranun Sirikul; 
  • Stefan Schulz; 
  • Markus Eduard Kreuzthaler

ABSTRACT

Background:

Extracting accurate medication information from Thai hospital records presents challenges due to the narrative style of medical notes, which often combine Thai and English terminology. This study aimed to address these difficulties by leveraging ontology-based named entity recognition (NER) and pre-trained transformer models.

Objective:

The primary objective was to investigate the effectiveness of ontology-based NER and pre-trained transformer models in extracting medication information from unstructured Thai hospital records, thereby improving data standardization and interoperability in Thai healthcare.

Methods:

An annotated dataset comprising 90 discharge summaries was developed, based on SNOMED-CT and FHIR standards for medical terminology. Three deep learning models—BioClinicalBERT, ClinicalBERT, and Microsoft BiomedNLP—were trained and subsequently evaluated on this dataset to assess their performance in entity recognition.

Results:

Among the models tested, ClinicalBERT demonstrated the highest overall F1-score in entity recognition. It performed particularly well at identifying drug substances and dosage entity types.

Conclusions:

The findings suggest that ontology-based medication information extraction using transformer-based models holds significant promise for enhancing data standardization and interoperability within the Thai healthcare system. This approach offers a viable solution for overcoming the complexities of unstructured Thai medical notes.


 Citation

Please cite as:

Isaradech N, Sirikul W, Schulz S, Kreuzthaler ME

Ontology-Based Medication Named Entity Recognition Using Pretrained Transformer Models From a Thai Hospital: Model Fine-Tuning and Validation Study

JMIR Form Res 2026;10:e82685

DOI: 10.2196/82685

PMID: 41861368

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.