Accepted for/Published in: JMIR Medical Informatics
Date Submitted: May 21, 2021
Date Accepted: Jan 2, 2022
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Selective Prediction LSTM for Time Series Health Datasets using Unit-wise Batch Standardization: Algorithm Development and Validation
ABSTRACT
Background:
In any healthcare system, both the classification of data and the confidence level of the classification are important. A selective prediction model is therefore needed to classify time-series health data according to confidence levels of prediction.
Objective:
The aim of this study is to develop a method using Long short-term memory (LSTM) models with reject option for time-series health data classification.
Methods:
To implement a reject option of classification output in LSTM models, an existing selective prediction method was adopted. However, a conventional selection function approach to LSTM does not achieve acceptable performance at the learning stage. To tackle this problem, we propose unit-wise batch standardization (UBS), which attempts to normalize each hidden unit in LSTM to reflect the structural characteristics of LSTM with respect to selection function.
Results:
From the results, the ability of our method to approximate the target confidence level was compared by coverage violations for two time series health datasets consisting of human activity and arrhythmia. For both datasets, our approach yielded lower average coverage violations (0.98% and 1.79% for each dataset) than conventional approach. In addition, the classification performance using the reject option was compared with other normalization methods. Our method demonstrates superior performance with respect to selective risk (12.63% and 17.82% for each dataset), false-positive rates (2.09% and 5.80% for each dataset), and false-negative rates (10.58% and 17.24% for each dataset).
Conclusions:
We conclude that our normalization approach can help make selective predictions for time-series health data. We expect this technique will give users more confidence in classification systems and improve collaborative efforts between human and artificial intelligence levels in the medical field through the use of classification that reflects confidence.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.