Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Formative Research

Date Submitted: Oct 2, 2024
Open Peer Review Period: Sep 30, 2024 - Oct 11, 2024
Date Accepted: Aug 26, 2025
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

A Machine Learning–Based Scoring System to Identify High Immunoactivity Microsatellite Stability Tumors by Quantifying Similarity to Microsatellite Instability-High Tumors in Colorectal Cancers: Development and Quantitative Study

Yan H, Jiang L, Li Y, Wang F, Mo S, Sheng W, Huang D, Peng J

A Machine Learning–Based Scoring System to Identify High Immunoactivity Microsatellite Stability Tumors by Quantifying Similarity to Microsatellite Instability-High Tumors in Colorectal Cancers: Development and Quantitative Study

JMIR Form Res 2025;9:e66960

DOI: 10.2196/66960

PMID: 41100766

PMCID: 12530644

A Machine Learning-Based Scoring System to Identify High Immunoactivity MSS Tumours by Quantifying Similarity to MSI-H Tumours in Colorectal Cancers

  • Hongkai Yan; 
  • Li Jiang; 
  • Yaqi Li; 
  • Fengchong Wang; 
  • Shaobo Mo; 
  • Weiqi Sheng; 
  • Dan Huang; 
  • Junjie Peng

ABSTRACT

Background:

Microsatellite stability (MSS) colorectal cancers (CRCs) have a limited response to immune checkpoint inhibitors (ICIs) compared to microsatellite instability-high (MSI-H) CRCs. Nevertheless, previous studies have shown that some MSS CRCs are sensitive to immune checkpoint inhibitors (ICIs), although established criteria for treatment justification are still lacking.

Objective:

We aimed to test the TIL features of MSS and develop a novel computational tool for the similarity prediction between MSS and MSI-H status in CRC patients based on multiple factors.

Methods:

Data from 188 CRC patients, including MSI status, immune cell distributions, clinical features, and gene mutations, were collected and analysed using statistical methods and Cox regression. An ensemble machine learning-based MSI-H score was developed using stacked XGBoost classifiers to quantify the similarity of patient data to MSI-H data based on immune cell distributions, clinical features, and gene mutations. The model is robust and can address missing input data for immune cell distributions and gene mutations.

Results:

The scorer performed well (Mean Cohen's kappa value = .42) in identifying MSI-H-like MSS samples with TIL distributions similar to genuine MSI-H CRCs. No significant difference was observed between the TIL features of MSI-H-like MSS CRCs and MSI-H CRCs. The disparity between MSI-H-like MSS CRCs and MSS CRCs potentially lies in the T regulatory cells (P = .094) and macrophage (P = .156) populations within the tumour stromal region.

Conclusions:

Part of the MSS CRC patients presented similar immune cell distributions with high immunoactivity compared to MSI-H patients. The MSI-H score serves as a metric to quantify the similarity of MSS CRCs to MSI-H CRCs, and presents a promising avenue for more personalized and effective cancer immunotherapy treatment, offering a clinical reference for potential ICI targets in MSS CRCs.


 Citation

Please cite as:

Yan H, Jiang L, Li Y, Wang F, Mo S, Sheng W, Huang D, Peng J

A Machine Learning–Based Scoring System to Identify High Immunoactivity Microsatellite Stability Tumors by Quantifying Similarity to Microsatellite Instability-High Tumors in Colorectal Cancers: Development and Quantitative Study

JMIR Form Res 2025;9:e66960

DOI: 10.2196/66960

PMID: 41100766

PMCID: 12530644

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.