JMIR Preprints #18418: Can the Use of Attention Mechanisms be Assured in Clinical Research?: Evaluation of Current Design Approaches of Attention Mechanisms in Deep Learning Algorithms

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Can the Use of Attention Mechanisms be Assured in Clinical Research?: Evaluation of Current Design Approaches of Attention Mechanisms in Deep Learning Algorithms

Junetae Kim;
Sangwon Lee;
Eugene Hwang;
Kwang Sun Ryu;
Jae Wook Lee;
Yul Hwangbo;
Kui Son Choi;
Hyo Soung Cha

ABSTRACT

Background:

Despite excellent prediction performance, non-interpretability has undermined the value of applying deep learning algorithms in clinical practice. To overcome this limitation, an explanatory modeling method called attention mechanism has been introduced to clinical research. However, gentle guidance and precautions for using this attractive method have not been well provided to clinical and informatics researchers. Furthermore, there has been a lack of discussion on the predictive and interpretive performance of this method when applied to health data.

Objective:

The purpose of this study is to provide clinical researchers with the basic concepts and design approaches of attention mechanisms. In addition, the study aims to evaluate current design approaches of attention mechanisms in terms of prediction and interpretability performance.

Methods:

First, the basic concepts and several key considerations regarding attention mechanisms are provided. Second, the four approaches to attention mechanisms are introduced according to a two-dimensional framework based on degree of freedom and uncertainty awareness. Third, 1) prediction performance, 2) probability reliability, 3) concentration of variable importance, 4) consistency of attention results, and 5) generalizability of attention results to conventional statistics, are assessed in the diabetic classification modeling setting. Fourth, the performances of the four attention design approaches are discussed.

Results:

Prediction performance was very high for all models. Probability reliability was high in models with a high degree of freedom. Variable importance was concentrated in several variables when uncertainty awareness was not considered. Consistency of attention results was high when uncertainty awareness was considered. The generalizability of attention results to conventional statistics was poor regardless of the modeling approach.

Conclusions:

The attention mechanism is obviously an attractive technique, which could be very promising in the future. However, naive attention implementations may lead to poor results when determining variable importance. Therefore, more robust theoretical studies of attention mechanisms should be encouraged.

Citation

Please cite as:

Kim J, Lee S, Hwang E, Ryu KS, Lee JW, Hwangbo Y, Choi KS, Cha HS

Limitations of Deep Learning Attention Mechanisms in Clinical Research: Empirical Case Study Based on the Korean Diabetic Disease Setting

J Med Internet Res 2020;22(12):e18418

DOI: 10.2196/18418

PMID: 33325832

PMCID: 7773508

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Feb 25, 2020

Date Accepted: Nov 11, 2020

Can the Use of Attention Mechanisms be Assured in Clinical Research?: Evaluation of Current Design Approaches of Attention Mechanisms in Deep Learning Algorithms

ABSTRACT

Citation

Copyright