Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jan 3, 2024
Open Peer Review Period: Jan 11, 2024 - Mar 7, 2024
Date Accepted: May 20, 2024
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Creating a Modified Version of the Cambridge Multimorbidity Score to Predict Mortality in People Older Than 16 Years: Model Development and Validation

Kar D, Taylor KS, Joy M, Venkatesan S, Meeraus W, Taylor S, Anand S, Ferreira F, Jamie G, Fan X, de Lusignan S

Creating a Modified Version of the Cambridge Multimorbidity Score to Predict Mortality in People Older Than 16 Years: Model Development and Validation

J Med Internet Res 2024;26:e56042

DOI: 10.2196/56042

PMID: 39186368

PMCID: 11384182

Creating a Modified Version of the Cambridge Multimorbidity Score (CMMS) to Predict Mortality in People Over 16 Years in the English Nationwide General Practice Extraction Service Data for Pandemic Planning and Research (GDPPR) Dataset: Model Development and Validation

  • Debasish Kar; 
  • Kathryn Suzann Taylor; 
  • Mark Joy; 
  • Sudhir Venkatesan; 
  • Wilhelmine Meeraus; 
  • Sylvia Taylor; 
  • Sneha Anand; 
  • Filipa Ferreira; 
  • Gavin Jamie; 
  • Xuejuan Fan; 
  • Simon de Lusignan

ABSTRACT

Background:

No single multimorbidity measure is validated for use in NHS England's General Practice Extraction Service Data for Pandemic Planning and Research (GDPPR), the nationwide primary care dataset created for coronavirus disease 19 (COVID-19) pandemic research. A single morbidity measure is advantageous when there is a need to adjust for multimorbidity, such as modelling the effectiveness of vaccinations against COVID-19, as including multiple individual morbidities is challenging. The Cambridge Multimorbidity Score (CMMS) is a validated tool for predicting mortality risk. However, the number of Systematised Nomenclature of Medicine clinical terms (SNOMED CT) for the GDPPR dataset is limited and does not define all the conditions used to calculate the CMMS

Objective:

To develop and validate a modified version of CMMS using the clinical terms available for the GDPPR.

Methods:

We used pseudonymised data from the Oxford-Royal College of General Practitioners Research and Surveillance Centre (RCGP RSC), which has a more extensive SNOMED CT list. From the 37 conditions used in the original CMMS model, we selected conditions either with: (a) high prevalence ratio (≥ 85%), calculated as the prevalence in the RSC data set as defined by the GDPPR set of SNOMED CT codes, divided by the prevalence as defined by the RSC set of SNOMED CT codes, or (b) conditions with lower prevalence ratio but with high predictive value. The resulting set of conditions was included in Cox proportional hazard models to determine the 1-year mortality risk in a development dataset (n=300,000) and construct a new CMMS model, following the original CMMS, with variable reduction and parsimony, achieved by backward elimination and Akaike information stopping criterion. Model validation involved obtaining 1-year mortality estimates for a synchronous dataset (n=150,000) and 1-year and 5-year mortality estimates for an asynchronous dataset (n=150,000).

Results:

The initial model contained 22 conditions and our final model included 17 conditions. The conditions overlapped with those of a modified CMMS, which we previously developed using RSC data and the more extensive RSC SNOMED CT list. For 1-year mortality, discrimination was high in both the derivation and validation datasets (Harrell's C=0.92), and 5-year mortality was slightly lower (Harrell's C= 0.90), and the calibration was reasonable following an adjustment for over-fitting. The performance was similar to that of both the original and previous modified CMMS models.

Conclusions:

The modified version of the CMMS can be used on the GDPPR, a nationwide primary care dataset of 54 million people, to predict mortality in people in real-world vaccine effectiveness, pandemic planning, and other research studies. It requires 17 variables to produce a comparable performance with our previous modification of CMMS to enable it to be used in routine data using SNOMED CT.


 Citation

Please cite as:

Kar D, Taylor KS, Joy M, Venkatesan S, Meeraus W, Taylor S, Anand S, Ferreira F, Jamie G, Fan X, de Lusignan S

Creating a Modified Version of the Cambridge Multimorbidity Score to Predict Mortality in People Older Than 16 Years: Model Development and Validation

J Med Internet Res 2024;26:e56042

DOI: 10.2196/56042

PMID: 39186368

PMCID: 11384182

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.