Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: Mar 19, 2021
Date Accepted: Oct 4, 2021
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Energy Efficiency of Inference Algorithms for Medical Datasets: A Green AI study
ABSTRACT
Background:
Harnessing artificial intelligence (AI) in medical domain has raised considerable interest recently. An AI model must be energy-efficient if it has to be used for inference applications in medical domain. Different from other type of data in visual AI, data in medical domain are usually composed of features with strong signals. Numerous energy optimization techniques have been developed to relieve the burden on the hardware required to deploy a complex learning model. However, the energy efficiencies of different AI models used for medical applications have not yet been studied.
Objective:
To explore and compare the energy efficiencies of widely-used machine learning (ML) algorithms, including logistic regression (LR), k-nearest neighbors (kNN), support vector machine (SVM), random forest (RF), extreme gradient boosting (XGB), and two different neural networks (NN) in the medical datasets.
Methods:
We applied the algorithms above to two distinct medical datasets, the mass spectrometry data of Staphylococcus aureus for predicting methicillin-resistance (“Mass spectrometry” dataset: 3338 cases; 268 features), and the urinalysis data for predicting Trichomonas vaginalis infection (“Urinalysis” dataset: 839,164 cases; 9 features). We compared the performance among these seven inference algorithms across accuracy, area under the receiver operating characteristic (AUROC), time consumption, and power consumption. The time and power consumptions were determined using the performance counter data from Intel Power Gadget 3.5.
Results:
Experimental results showed that the RF and XGB algorithms achieved the two highest AUROC scores with both datasets (84.7% and 83.9% with the “Mass spectrometry” dataset, respectively, and 91.1% and 91.4% with the “Urinalysis” dataset, respectively). In terms of time consumption, the XGB, 1-hidden-layer NN and LR algorithms exhibit the lowest time consumption with both datasets. RF as the referral baseline, XGB, 1-hidden-layer NN and LR achieved 45% reduction of inferencing time with the “Mass spectrometry” dataset, and 53-60% reduction with the “Urinalysis” dataset, respectively. In terms of energy efficiency, XGB, LR, SVM and RF consumed the least power. 5-hidden-layer NN as the referral baseline, XGB, LR, SVM and RF achieved 24-32% reduction of power consumption with the “Mass spectrometry” dataset, and 20-53% reduction with the “Urinalysis” dataset, respectively. Among all experiments, XGB achieved the best performance across accuracy, runtime, and energy efficiency.
Conclusions:
In current study, XGB attained a balanced performance across accuracy, runtime, and energy efficiency in the medical datasets. The research results indicate that the XGB would be an ideal algorithm for applying ML to real-world medical scenarios.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.