Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Cancer

Date Submitted: Apr 25, 2024
Open Peer Review Period: Apr 24, 2024 - Jun 19, 2024
Date Accepted: Nov 27, 2024
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Analyzing Geospatial and Socioeconomic Disparities in Breast Cancer Screening Among Populations in the United States: Machine Learning Approach

Hashtarkhani S, Zhou Y, Kumsa FA, White-Means S, Schwartz DL, Shaban-Nejad A

Analyzing Geospatial and Socioeconomic Disparities in Breast Cancer Screening Among Populations in the United States: Machine Learning Approach

JMIR Cancer 2025;11:e59882

DOI: 10.2196/59882

PMID: 39819978

PMCID: 11756836

Analyzing Geospatial and Socioeconomic Disparities in Breast Cancer Screening among Populations in the United States: A Machine Learning Approach

  • Soheil Hashtarkhani; 
  • Yiwang Zhou; 
  • Fekede Asefa Kumsa; 
  • Shelley White-Means; 
  • David L Schwartz; 
  • Arash Shaban-Nejad

ABSTRACT

Background:

Breast cancer screening plays a pivotal role in early detection and subsequent effective management of the disease, significantly impacting patient outcomes and survival rates.

Objective:

This study aims to assess breast cancer screening rates nationwide in the United States (U.S.) and investigate the impact of social determinants of health on these screening rates

Methods:

Data on mammography screening for the years 2018 and 2020 at the census tract level were collected from the Behavioral Risk Factor Surveillance System (BRFSS). We developed a large-scale dataset of social determinants of health with 13 variables for 72,337 census tracts. Spatial analysis, employing Getis-Ord Gi statistics, was used to identify clusters of high and low breast cancer screening rates. Next, we implemented a random forest (RF) model and compared its performance with linear regression (LR) and support vector machine (SVM). Shapley Additive exPlanations (SHAP) values were then used to assess variable importance and directions of influence. Geospatial analysis revealed elevated screening rates in the eastern and northern U.S., while central and Midwestern areas exhibited lower rates.

Results:

The RF model demonstrated superior performance compared to LR and SVM. SHAP values indicated that the percentage of the Black population, the number of mammography facilities within a 10-mile radius, and the percentage of the population with at least a bachelor's degree were the most influential variables, all positively associated with mammography screening rates.

Conclusions:

These findings underscore the significance of social factors and the accessibility of mammography services in explaining the variability of breast cancer screening in the U.S. emphasizing the need for targeted policy interventions in areas with relatively lower screening rates.


 Citation

Please cite as:

Hashtarkhani S, Zhou Y, Kumsa FA, White-Means S, Schwartz DL, Shaban-Nejad A

Analyzing Geospatial and Socioeconomic Disparities in Breast Cancer Screening Among Populations in the United States: Machine Learning Approach

JMIR Cancer 2025;11:e59882

DOI: 10.2196/59882

PMID: 39819978

PMCID: 11756836

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.