Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Mar 19, 2021
Date Accepted: Oct 4, 2021

The final, peer-reviewed published version of this preprint can be found here:

Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study

Yu JR, Chen CH, Huang TW, Lu JJ, Chung CR, Lin TW, Wu MH, Tseng YJ, Wang HY

Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study

J Med Internet Res 2022;24(1):e28036

DOI: 10.2196/28036

PMID: 35076405

PMCID: 8826151

Energy Efficiency of Inference Algorithms for Clinical Laboratory Datasets: A Green AI Study

  • Jia-Ruei Yu; 
  • Chun-Hsien Chen; 
  • Tsung-Wei Huang; 
  • Jang-Jih Lu; 
  • Chia-Ru Chung; 
  • Ting-Wei Lin; 
  • Min-Hsien Wu; 
  • Yi-Ju Tseng; 
  • Hsin-Yao Wang

ABSTRACT

Background:

Harnessing artificial intelligence (AI) in medical domain has raised considerable interest recently. An AI model must be energy-efficient if it has to be used for inference applications in medical domain. Different from other type of data in visual AI, data in medical domain are usually composed of features with strong signals. Numerous energy optimization techniques have been developed to relieve the burden on the hardware required to deploy a complex learning model. However, the energy efficiencies of different AI models used for medical applications have not yet been studied.

Objective:

To explore and compare the energy efficiencies of widely-used machine learning (ML) algorithms, including logistic regression (LR), k-nearest neighbors (kNN), support vector machine (SVM), random forest (RF), extreme gradient boosting (XGB), and two different neural networks (NN) in the medical datasets.

Methods:

We applied the algorithms above to two distinct medical datasets, the mass spectrometry data of Staphylococcus aureus for predicting methicillin-resistance (“Mass spectrometry” dataset: 3338 cases; 268 features), and the urinalysis data for predicting Trichomonas vaginalis infection (“Urinalysis” dataset: 839,164 cases; 9 features). We compared the performance among these seven inference algorithms across accuracy, area under the receiver operating characteristic (AUROC), time consumption, and power consumption. The time and power consumptions were determined using the performance counter data from Intel Power Gadget 3.5.

Results:

Experimental results showed that the RF and XGB algorithms achieved the two highest AUROC scores with both datasets (84.7% and 83.9% with the “Mass spectrometry” dataset, respectively, and 91.1% and 91.4% with the “Urinalysis” dataset, respectively). In terms of time consumption, the XGB, 1-hidden-layer NN and LR algorithms exhibit the lowest time consumption with both datasets. RF as the referral baseline, XGB, 1-hidden-layer NN and LR achieved 45% reduction of inferencing time with the “Mass spectrometry” dataset, and 53-60% reduction with the “Urinalysis” dataset, respectively. In terms of energy efficiency, XGB, LR, SVM and RF consumed the least power. 5-hidden-layer NN as the referral baseline, XGB, LR, SVM and RF achieved 24-32% reduction of power consumption with the “Mass spectrometry” dataset, and 20-53% reduction with the “Urinalysis” dataset, respectively. Among all experiments, XGB achieved the best performance across accuracy, runtime, and energy efficiency.

Conclusions:

In current study, XGB attained a balanced performance across accuracy, runtime, and energy efficiency in the medical datasets. The research results indicate that the XGB would be an ideal algorithm for applying ML to real-world medical scenarios.


 Citation

Please cite as:

Yu JR, Chen CH, Huang TW, Lu JJ, Chung CR, Lin TW, Wu MH, Tseng YJ, Wang HY

Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study

J Med Internet Res 2022;24(1):e28036

DOI: 10.2196/28036

PMID: 35076405

PMCID: 8826151

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.