Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Diabetes

Date Submitted: Jun 16, 2025
Date Accepted: Dec 17, 2025

The final, peer-reviewed published version of this preprint can be found here:

Privacy-Preserving Collaborative Diabetes Prediction in Heterogeneous Health Care Systems: Algorithm Development and Validation of a Secure Federated Ensemble Framework

Hasan MR, Li J

Privacy-Preserving Collaborative Diabetes Prediction in Heterogeneous Health Care Systems: Algorithm Development and Validation of a Secure Federated Ensemble Framework

JMIR Diabetes 2026;11:e79166

DOI: 10.2196/79166

PMID: 41587070

PMCID: 12834198

From Clinics to Wearables: Secure and Efficient Diabetes Prediction with Federated Ensemble Learning Across Heterogeneous Healthcare Systems

  • Md Rakibul Hasan; 
  • Juan Li

ABSTRACT

Background:

Diabetes prediction requires accurate, privacy-preserving, and scalable solutions. Traditional machine learning models rely on centralized data, posing risks to data privacy and regulatory compliance. Moreover, healthcare settings are highly heterogeneous, with diverse participants, hospitals, clinics, and wearables, producing non-IID data and operating under varied computational constraints. Learning in isolation at individual institutions limits model generalizability and effectiveness. Collaborative federated learning enables institutions to jointly train models without sharing raw data, but current approaches often struggle with heterogeneity, security threats, and system coordination.

Objective:

This study aims to develop a secure, scalable, and privacy-preserving framework for diabetes prediction by integrating federated learning with ensemble modeling, blockchain-based access control, and knowledge distillation. The framework is designed to handle data heterogeneity, non-IID distributions, and varying computational capacities across diverse healthcare participants, while simultaneously enhancing data privacy, security, and trust.

Methods:

We propose a Federated Ensemble Learning framework, FedEnTrust, that enables decentralized healthcare participants to collaboratively train models without sharing raw data. Each participant shares soft label outputs, which are distilled and aggregated through adaptive weighted voting to form a global consensus. The framework supports heterogeneous participants by assigning model architectures based on local computational capacity. To ensure secure and transparent coordination, a blockchain-enabled smart contract governs participant registration, role assignment, and model submission with strict role-based access control. We evaluated the system on the PIMA Indians Diabetes Dataset, measuring prediction accuracy, communication efficiency, and blockchain performance.

Results:

The FedEnTrust framework achieved 84.2% accuracy, with precision, recall, and F1-score of 84.6%, 88.6%, and 86.4%, respectively, outperforming existing decentralized models and nearing centralized deep learning benchmarks. The blockchain-based smart contract ensured 100% success for authorized transactions and rejected all unauthorized attempts, including malicious submissions. Average blockchain latency was 210 milliseconds, with a gas cost of ~107,940 units, enabling secure, real-time interaction. Throughout, patient privacy was preserved by exchanging only model metadata, not raw data.

Conclusions:

FedEnTrust offers a deployable, privacy-preserving solution for decentralized healthcare prediction by integrating federated learning, ensemble modeling, blockchain-based access control, and knowledge distillation. It balances accuracy, scalability, and ethical data use while enhancing security and trust. This work demonstrates that secure federated ensemble systems can serve as practical alternatives to centralized AI models in real-world healthcare applications.


 Citation

Please cite as:

Hasan MR, Li J

Privacy-Preserving Collaborative Diabetes Prediction in Heterogeneous Health Care Systems: Algorithm Development and Validation of a Secure Federated Ensemble Framework

JMIR Diabetes 2026;11:e79166

DOI: 10.2196/79166

PMID: 41587070

PMCID: 12834198

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.