Loading…

Machine Learning Classification Techniques for Breast Cancer Diagnosis

Breast cancer is one of the most widely spread disease and the second leading cause of cancer death among women. Breast cancer starts when malignant lumps which are cancerous begin to grow from the breast cells. Doctors may wrongly diagnose benign tumor (which is non-cancerous) as malignant tumor. T...

Full description

Saved in:
Bibliographic Details
Published in:IOP conference series. Materials Science and Engineering 2019-04, Vol.495 (1), p.12033
Main Authors: Omondiagbe, David A., Veeramani, Shanmugam, Sidhu, Amandeep S.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Breast cancer is one of the most widely spread disease and the second leading cause of cancer death among women. Breast cancer starts when malignant lumps which are cancerous begin to grow from the breast cells. Doctors may wrongly diagnose benign tumor (which is non-cancerous) as malignant tumor. There is need for a computer aided detection (CAD) systems which uses machine learning approach to provide accurate diagnosis of breast cancer. These CAD systems can aid in detecting breast cancer at an early stage. When, breast cancer is detected early enough, the survival rate increases because better treatment can be provided. This paper aims at investigating Support Vector Machine (using radial basis kernel), Artificial Neural Networks and Naïve Bayes using the Wisconsin Diagnostic Breast Cancer (WDBC) Dataset. The focus of this paper is to integrate these machine learning techniques with feature selection/feature extraction methods and compare their performances to identify the most suitable approach. The goal is combining the advantages of dimensionality reduction and machine learning. This paper proposed a hybrid approach for breast cancer diagnosis by reducing the high dimensionality of features using linear discriminant analysis (LDA), and then applying the new reduced feature dataset to Support Vector Machine. The proposed approach obtained an accuracy of 98.82%, sensitivity of 98.41%, specificity of 99.07% and area under the receiver operating characteristic curve of 0.9994.
ISSN:1757-8981
1757-899X
1757-899X
DOI:10.1088/1757-899X/495/1/012033