Accepted for/Published in: JMIR AI
Date Submitted: Nov 9, 2023
Open Peer Review Period: Nov 9, 2023 - Feb 2, 2024
Date Accepted: Mar 30, 2024
(closed for review but you can still tweet)
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Near-real time syndromic surveillance of Emergency Department triage text: A Natural Language Processing (NLP) approach
ABSTRACT
Background:
Collecting information on adverse events following immunization from as many sources as possible is critical for prompt identification of any potential safety concerns, so that appropriate actions can be taken. Febrile convulsions are recognized as an important potential reaction to vaccination in children under the age of 6.
Objective:
The primary aim of this study is to evaluate natural language processing (NLP) techniques and machine learning models for the rapid detection of febrile convulsion presentations in emergency departments Additionally, we examine the deployment requirements for a machine learning model to perform real-time monitoring of emergency department triage notes.
Methods:
We developed a pattern matching approach as a baseline and evaluated several machine learning models for the classification of febrile convulsions in emergency department (ED triage notes, to determine both their training requirements and their effectiveness in detecting febrile convulsions. We measured their performance during training and then compared the deployed models’ result on new incoming ED data.
Results:
Although the best standard neural networks had acceptable performance and were low resource models, transformer-based models outperformed them significantly, justifying their ongoing deployment.
Conclusions:
Employing NLP, particularly with the utilization of large language models, offers significant advantages in syndromic surveillance. Large language models make highly effective classifiers, and their text generation capacity can be used to enhance the quality and diversity of training data.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.