Loading…

Automatic breast cancer diagnosis based on hybrid dimensionality reduction technique and ensemble classification

Introduction Feature selection in the face of high-dimensional data can reduce overfitting and learning time, and at the same time improve the accuracy and efficiency of the system. Since there are many irrelevant and redundant features in breast cancer diagnosis, removing such features leads to mor...

Full description

Saved in:
Bibliographic Details
Published in:Journal of cancer research and clinical oncology 2023-08, Vol.149 (10), p.7609-7627
Main Authors: Li, Xingyuan, Chen, Xi, Rezaeipanah, Amin
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Introduction Feature selection in the face of high-dimensional data can reduce overfitting and learning time, and at the same time improve the accuracy and efficiency of the system. Since there are many irrelevant and redundant features in breast cancer diagnosis, removing such features leads to more accurate prediction and reduced decision time when dealing with large-scale data. Meanwhile, ensemble classifiers are powerful techniques to improve the prediction performance of classification models, where several individual classifier models are combined to achieve higher accuracy. Methods In this paper, an ensemble classifier algorithm based on multilayer perceptron neural network is proposed for the classification task, in which the parameters (e.g., number of hidden layers, number of neurons in each hidden layer, and weights of links) are adjusted based on an evolutionary approach. Meanwhile, this paper uses a hybrid dimensionality reduction technique based on principal component analysis and information gain to address this problem. Results The effectiveness of the proposed algorithm was evaluated based on the Wisconsin breast cancer database. In particular, the proposed algorithm provides an average of 17% better accuracy compared to the best results obtained from the existing state-of-the-art methods. Conclusion Experimental results show that the proposed algorithm can be used as an intelligent medical assistant system for breast cancer diagnosis.
ISSN:0171-5216
1432-1335
DOI:10.1007/s00432-023-04699-x