Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Aug 22, 2017
Date Accepted: Jan 10, 2018
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation

Kim M, Song Y, Wang S, Xia Y, Jiang X

Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation

JMIR Med Inform 2018;6(2):e19

DOI: 10.2196/medinform.8805

PMID: 29666041

PMCID: 5930176

Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation

  • Miran Kim; 
  • Yongsoo Song; 
  • Shuang Wang; 
  • Yuhou Xia; 
  • Xiaoqian Jiang

ABSTRACT

Background:

Learning a model without accessing raw data has been an intriguing idea to security and machine learning researchers for years. In an ideal setting, we want to encrypt sensitive data to store them on a commercial cloud and run certain analyses without ever decrypting the data to preserve privacy. Homomorphic encryption technique is a promising candidate for secure data outsourcing, but it is a very challenging task to support real-world machine learning tasks. Existing frameworks can only handle simplified cases with low-degree polynomials such as linear means classifier and linear discriminative analysis.

Objective:

The goal of this study is to provide a practical support to the mainstream learning models (eg, logistic regression).

Methods:

We adapted a novel homomorphic encryption scheme optimized for real numbers computation. We devised (1) the least squares approximation of the logistic function for accuracy and efficiency (ie, reduce computation cost) and (2) new packing and parallelization techniques.

Results:

Using real-world datasets, we evaluated the performance of our model and demonstrated its feasibility in speed and memory consumption. For example, it took approximately 116 minutes to obtain the training model from the homomorphically encrypted Edinburgh dataset. In addition, it gives fairly accurate predictions on the testing dataset.

Conclusions:

We present the first homomorphically encrypted logistic regression outsourcing model based on the critical observation that the precision loss of classification models is sufficiently small so that the decision plan stays still.


 Citation

Please cite as:

Kim M, Song Y, Wang S, Xia Y, Jiang X

Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation

JMIR Med Inform 2018;6(2):e19

DOI: 10.2196/medinform.8805

PMID: 29666041

PMCID: 5930176

Per the author's request the PDF is not available.