Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIRx Med

Date Submitted: Aug 26, 2020
Open Peer Review Period: Aug 26, 2020 - Sep 23, 2020
Date Accepted: Sep 30, 2020
Date Submitted to PubMed: Aug 4, 2023
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

A Machine Learning Explanation of the Pathogen-Immune Relationship of SARS-CoV-2 (COVID-19), and a Model to Predict Immunity and Therapeutic Opportunity: A Comparative Effectiveness Research Study

Luellen E

A Machine Learning Explanation of the Pathogen-Immune Relationship of SARS-CoV-2 (COVID-19), and a Model to Predict Immunity and Therapeutic Opportunity: A Comparative Effectiveness Research Study

JMIRx Med 2020;1(1):e23582

DOI: 10.2196/23582

PMID: 33711083

PMCID: 7924715

A machine learning explanation of the pathogen-immune relationship of SARS-CoV-2 (COVID-19), model to predict immunity, and therapeutic opportunities: A comparative effectiveness research study

  • Eric Luellen

ABSTRACT

Background:

Approximately 80% of those infected with COVID-19 are immune. They are asymptomatic unknown carriers who still can infect those with whom they come into contact. Understanding what makes them immune could inform public health policies as to who needs to be protected and why, and possibly lead to novel therapeutics for those who cannot, or will not, be vaccinated once a vaccine is available. The clinical impacts of this study are it: (1) identified three immunological factors that differentiate asymptomatic, or resistant, COVID-19 patients; (2) identified the levels of those factors that can be used by clinicians to predict who is likely to be asymptomatic or symptomatic; (3) identified a novel COVID-19 therapeutic for further testing; and, (4) ordinally ranked 34 common immunological factors by their importance in predicting disease severity.

Objective:

The primary objectives of this study were to learn if machine learning could identify patterns in the pathogen-host immune relationship that differentiate or predict COVID-19 symptom immunity and, if so, which ones and at what levels. The secondary objective was to learn if machine learning could take such differentiators to build a model that could predict COVID-19 immunity with clinical accuracy. The tertiary objective was to learn about the relevance of other immune factors. Design: This was a comparative effectiveness research study on 53 common immunological factors using machine learning on clinical data from 74 similarly-grouped Chinese COVID-19-positive patients, 37 of whom were symptomatic and 37 asymptomatic. Setting: A single-center primary-care hospital in the Wanzhou District of China. Participants: Immunological factors were measured in patients who were diagnosed as SARS-CoV-2 positive by reverse transcriptase-polymerase chain reaction (RT-PCR) in the 14 days before the recordation of the observations. The median age of the 37 asymptomatic patients was 41 years (range 8-75 years), 22 were female, 15 were male. For comparison, 37 RT-PCR test-positive patients were selected and matched to the asymptomatic group by age, comorbidities, and sex. Main Outcome: The primary study outcome was that asymptomatic COVID-19 patients could be identified by three distinct immunological factors and level: stem-cell growth factor-beta (SCGF-) (> 127637), interleukin-16 (IL-16) (> 45), and macrophage colony-stimulating factor (M-CSF) (> 57). The secondary study outcome was the novel suggestion that stem-cell therapy with SCGF- may be a new valuable therapeutic for COVID-19.

Methods:

This was a comparative effectiveness research study on 53 common immunological factors using machine learning on clinical data from 74 similarly-grouped Chinese COVID-19-positive patients, 37 of whom were symptomatic and 37 asymptomatic. The setting was a single-center primary-care hospital in the Wanzhou District of China. Immunological factors were measured in patients who were diagnosed as SARS-CoV-2 positive by reverse transcriptase-polymerase chain reaction (RT-PCR) in the 14 days before the recordation of the observations. The median age of the 37 asymptomatic patients was 41 years (range 8-75 years), 22 were female, 15 were male. For comparison, 37 RT-PCR test-positive patients were selected and matched to the asymptomatic group by age, comorbidities, and sex. Machine learning models were trained and compared to understand the pathogen-immune relationship and predict who was immune to COVID-19 and why using the statistical programming language R.

Results:

When SCGF- was included in the machine-learning analysis, a decision-tree and extreme gradient boosting algorithms classified and predicted COVID-19 symptoms immunity with 100% accuracy. When SCGF- was excluded, a random-forest algorithm classified and predicted COVID-19 asymptomatic and symptomatic cases with 94.8% area under the ROC curve accuracy (95% CI 90.17% to 100%). Thirty-four (34) common immune factors have statistically significant (P-value < .05) associations with COVID-19 symptoms and 19 immune factors appear to have no statistically significant association.

Conclusions:

People with an SCGF- level > 127637, or an IL-16 level > 45 and M-CSF level > 57, appear to be predictively immune to COVID-19, 100%, and 94.8% (ROC AUC) of the time, respectively. Testing levels of these three immunological factors may be a valuable tool at the point-of-care for managing and preventing outbreaks. Further, stem-cell therapy via SCGF- and/or M-CSF appear to be promising novel therapeutics for COVID-19.


 Citation

Please cite as:

Luellen E

A Machine Learning Explanation of the Pathogen-Immune Relationship of SARS-CoV-2 (COVID-19), and a Model to Predict Immunity and Therapeutic Opportunity: A Comparative Effectiveness Research Study

JMIRx Med 2020;1(1):e23582

DOI: 10.2196/23582

PMID: 33711083

PMCID: 7924715

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.