JMIR Preprints #19200: Use of a machine learning program to correctly triage incoming SMS text replies from a cardiovascular text-based secondary prevention program: a feasibility study

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Use of a machine learning program to correctly triage incoming SMS text replies from a cardiovascular text-based secondary prevention program: a feasibility study

Nicole Lowres;
Andrew Duckworth;
Julie Refern;
Aravinda Thiagalingam;
Clara K Chow

ABSTRACT

Background:

SMS text message programs are increasingly being used for secondary prevention, and have been shown to be effective in a number of health conditions including cardiovascular disease. Text programs have the potential to increase the reach of an intervention to larger numbers of people who may not access traditional programs, at an affordable cost to the health system. However, patients regularly reply to the text messages, leading to additional staffing requirements to monitor and moderate the patients’ text replies. This additional staff requirement directly impacts the cost effectiveness and scalability of text interventions.

Objective:

This study aimed to test the feasibility and accuracy of developing a machine learning (ML) program to ‘triage’ text message replies (i.e. identify which text message replies require a health professional review).

Methods:

Text message replies received from two clinical trials were manually coded firstly into ‘is staff review required?’ (binary response of yes/no); and secondly into 12-general categories. Five ML models (‘Naïve Bayes’, ‘OneVsRest’, ‘Random Forest decision trees’, ‘Gradient Boosted Trees’, ‘Multi-Layer Perceptron’) and an ensemble model were tested. For each model-run, data were randomly allocated into training-set (70%) and test-set (30%). Accuracy for yes/no classification was calculated using area under curve [AUC], false positives, and false negatives. Accuracy for classification into 12-categories was compared using multi-class classification evaluators.

Results:

Manual review of 3,118 text replies showed 22% required staff review. For determining need for staff review, the Multi-Layer Perceptron model had highest accuracy (area under curve [AUC] 0.86; 4.85% false negatives; 4.63% false positives); with addition of ‘heuristics’ (specified keywords) fewer false negatives were identified (3.19%) with small increase in false positives (7.66%) and AUC 0.79. Application of this model would result in 26.7% of text replies requiring review (true + false positives). The ensemble model produced the lowest false negatives (1.43%) at expense of higher false positives (16.19%). OneVsRest was most accurate (72.3%) for the 12-category classification.

Conclusions:

The ML program has high sensitivity for identifying the text replies requiring staff input. Incorporation of a ML program to review text replies could significantly reduce staff workload, as staff would not have to review all incoming texts. This could lead to substantial improvements in cost-effectiveness, scalability and capacity of text-based interventions.

Citation

Please cite as:

Lowres N, Duckworth A, Refern J, Thiagalingam A, Chow CK

Use of a Machine Learning Program to Correctly Triage Incoming Text Messaging Replies From a Cardiovascular Text–Based Secondary Prevention Program: Feasibility Study

JMIR Mhealth Uhealth 2020;8(6):e19200

DOI: 10.2196/19200

PMID: 32543439

PMCID: 7327598

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR mHealth and uHealth

Date Submitted: Apr 7, 2020

Date Accepted: May 13, 2020

Use of a machine learning program to correctly triage incoming SMS text replies from a cardiovascular text-based secondary prevention program: a feasibility study

ABSTRACT

Citation

Copyright