Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Jun 16, 2025
Date Accepted: Dec 29, 2025

The final, peer-reviewed published version of this preprint can be found here:

Prediction of First and Multiple Antiretroviral Therapy Interruptions in People Living With HIV: Comparative Survival Analysis Using Cox and Explainable Machine Learning Models

Salami D, Koech E, Turan JM, Stafford KA, Nyagah LM, Ohakanu S, Ngugi AK, Charurat M

Prediction of First and Multiple Antiretroviral Therapy Interruptions in People Living With HIV: Comparative Survival Analysis Using Cox and Explainable Machine Learning Models

JMIR Med Inform 2026;14:e78964

DOI: 10.2196/78964

PMID: 41637488

PMCID: 12871577

Comparison of Traditional Cox Model and Explainable Machine Learning Models in Predicting First and Multiple Treatment Interruptions in Antiretroviral Therapy

  • Donald Salami; 
  • Emily Koech; 
  • Janet M. Turan; 
  • Kristen A. Stafford; 
  • Lilly Muthoni Nyagah; 
  • Stephen Ohakanu; 
  • Anthony K. Ngugi; 
  • Manhattan Charurat

ABSTRACT

Background:

The Cox proportional hazards (CPH) model is a common choice for analyzing time to treatment interruptions in patients on antiretroviral therapy (ART), valued for its straightforward interpretability and flexibility in handling time-dependent covariates. Machine learning (ML) models have increasingly been adapted for handling temporal data, with added advantages of handling complex, non-linear relationships, large datasets, and provide clear practical interpretations.

Objective:

This study aims to compare the predictive performance of the traditional CPH model and ML models in predicting treatment interruptions among patients on ART, while also providing both global and individual-level explanations to support personalized, data-driven interventions for improving treatment retention.

Methods:

Using data from 621,115 patients who started ART between 2017 and 2023, in Kenya, we compared the performance of the CPH with 6 different ML models – Gradient Boosting Machine, Extreme Gradient Boosting, Regularized Generalized Linear Models (Ridge, Lasso and Elastic-Net) and Recursive Partitioning – in predicting first and multiple treatment interruptions. Explainable surrogate technique (model-agnostic) was applied to interpret the best-performing model's predictions globally, using variable importance and partial dependence profiles (PDP), and at individual-level, using break-down additive (BD), Shapley additive explanations (SHAP), and ceteris paribus (CP).

Results:

Recursive partitioning (RP) model achieved the best performance with a predictive concordance index score (C-Index) of 0.81 for first treatment interruptions and 0.89 for multiple interruptions, outperforming the CPH model, which scored 0.78 and 0.87 for the same scenarios, respectively. RP’s performance can be attributed to its ability to model non-linear relationships and automatically detect complex interactions. The global model-agnostic explanations aligned closely with the interpretations offered by hazard ratios in the CPH model, while offering additional insights into the impact of specific features on the model's predictions. The BD and SHAP explainers demonstrated how different variables contribute to the predicted risk at the individual patient level. The CP profiles further explored the time-varying model, to illustrate how changes in a patient’s covariates over time could impact their predicted risk of treatment interruption.

Conclusions:

In conclusion, our results highlight the superior predictive performance of ML models and their ability to provide patient-specific risk predictions and insights that can support targeted interventions to reduce treatment interruptions in antiretroviral therapy care.


 Citation

Please cite as:

Salami D, Koech E, Turan JM, Stafford KA, Nyagah LM, Ohakanu S, Ngugi AK, Charurat M

Prediction of First and Multiple Antiretroviral Therapy Interruptions in People Living With HIV: Comparative Survival Analysis Using Cox and Explainable Machine Learning Models

JMIR Med Inform 2026;14:e78964

DOI: 10.2196/78964

PMID: 41637488

PMCID: 12871577

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.