Accepted for/Published in: JMIR Bioinformatics and Biotechnology
Date Submitted: Aug 22, 2023
Open Peer Review Period: Aug 21, 2023 - Oct 16, 2023
Date Accepted: Dec 3, 2023
(closed for review but you can still tweet)
Machine Learning for Prediction of Maternal Hemorrhage and Transfusion
ABSTRACT
Background:
Current postpartum hemorrhage (PPH) risk stratification is based on traditional statistical models or expert opinion. Machine learning could optimize PPH prediction by allowing for more complex modeling.
Objective:
We sought to improve PPH prediction and compare machine learning and traditional statistical methods.
Methods:
We developed models using the Consortium for Safe Labor dataset (2002-2008) from 12 US hospitals. The primary outcome was a transfusion of blood products or PPH (estimated blood loss ≥1,000mL). The secondary outcome was a transfusion of any blood products. Fifty antepartum and intrapartum characteristics and hospital characteristics were included. Logistic regression, support vector machines, multi-layer perceptron, random forest, and gradient boosting were used to generate prediction models. The receiver operating characteristic area under the curve (ROC-AUC) and precision/recall area under the curve (PR-AUC) were used to compare performance.
Results:
Among 228,438 births, 5,760 women (3.1%) had a postpartum hemorrhage, 5,170 women (2.8%) had a transfusion, and 10,344 women (5.6%) met the criteria for the transfusion-PPH composite. Models predicting transfusion-PPH composite using antepartum and intrapartum features had the best positive predictive values with the gradient boosting machine learning model performing best overall (ROC-AUC=0.833, 95% CI [0.828-0.838]; PR-AUC=0.210 95% CI [0.201-0.220]). The most predictive features in the gradient boosting model predicting transfusion-PPH composite were the mode of delivery, oxytocin incremental dose for labor(mU/min), intrapartum tocolytic use, presence of anesthesia nurse, and hospital type.
Conclusions:
Machine learning offers higher discrimination than logistic regression in predicting PPH. The CSL dataset may not be optimal for analyzing risk due to strong subgroup effects, which decreases accuracy and limits generalizability.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.