Accepted for/Published in: JMIR Formative Research
Date Submitted: Aug 24, 2025
Date Accepted: Feb 18, 2026
Ontology-based Medication Extraction: Named Entity Recognition using Pre-trained Transformer Models from a Thai Hospital
ABSTRACT
Background:
Extracting accurate medication information from Thai hospital records presents challenges due to the narrative style of medical notes, which often combine Thai and English terminology. This study aimed to address these difficulties by leveraging ontology-based named entity recognition (NER) and pre-trained transformer models.
Objective:
The primary objective was to investigate the effectiveness of ontology-based NER and pre-trained transformer models in extracting medication information from unstructured Thai hospital records, thereby improving data standardization and interoperability in Thai healthcare.
Methods:
An annotated dataset comprising 90 discharge summaries was developed, based on SNOMED-CT and FHIR standards for medical terminology. Three deep learning models—BioClinicalBERT, ClinicalBERT, and Microsoft BiomedNLP—were trained and subsequently evaluated on this dataset to assess their performance in entity recognition.
Results:
Among the models tested, ClinicalBERT demonstrated the highest overall F1-score in entity recognition. It performed particularly well at identifying drug substances and dosage entity types.
Conclusions:
The findings suggest that ontology-based medication information extraction using transformer-based models holds significant promise for enhancing data standardization and interoperability within the Thai healthcare system. This approach offers a viable solution for overcoming the complexities of unstructured Thai medical notes.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.