Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: Apr 23, 2019
Date Accepted: Jun 12, 2019
A Review on Breast Cancer Detection and Diagnosis using Mammographic Data
ABSTRACT
Background:
Machine learning (ML) has become a vital part of medical imaging research. ML methods have evolved over the years from manual seeded inputs to automatic initialization. The advancements in the ML field have led to more intelligent and self-reliant computer-aided diagnosis (CAD) systems as the learning ability of ML methods has been constantly improving. More and more automated methods are appearing with powerful feature learning and representations. Recent advancements of ML with deeper and extensive representation approach commonly known as deep learning (DL) approaches have made a very significant impact on improving the diagnostics capabilities of the CAD systems.
Objective:
This review aims to survey both traditional ML and DL literature in particular application for breast cancer diagnosis. The review also provides a brief insight into some well-known DL networks
Methods:
In this paper, we present an overview of ML and DL techniques with particular application for breast cancer. Specifically, we search the PubMed, Google Scholar, Medline, ScienceDirect,Springer and Web of Science databases and retrieve the studies in DL for the past five years that have used multi-view mammogram datasets.
Results:
The analysis of traditional machine learning reveals the limited usage of the methods, whereas the DL methods have great potential for implementation in clinical analysis and improve the diagnostic capability of existing CAD systems.
Conclusions:
From the literature, it can be found that heterogeneous breast densities make the masses more challenging to detect and classify as compared to calcifications. The traditional ML methods present confined approaches either limited to particular density type or datasets. While the DL methods show promising improvements in breast cancer diagnosis. However, there are still issues of data scarcity and computational cost which has been overcome to a significant extent by applying data augmentation and improved computational power of DL algorithms.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.