Loading…

Random forest swarm optimization-based for heart diseases diagnosis

[Display omitted] •Through combining the multi-objective particle swarm optimization and Random forest, a new approach is proposed to predict the heart disease.•The main goal is to produce diverse and accurate classifiers and determine the (near) optimal number of classifiers.•The results indicate t...

Full description

Saved in:
Bibliographic Details
Published in:Journal of biomedical informatics 2021-03, Vol.115, p.103690-103690, Article 103690
Main Authors: Asadi, Shahrokh, Roshan, SeyedEhsan, Kattan, Michael W.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:[Display omitted] •Through combining the multi-objective particle swarm optimization and Random forest, a new approach is proposed to predict the heart disease.•The main goal is to produce diverse and accurate classifiers and determine the (near) optimal number of classifiers.•The results indicate that the proposed algorithm outperforms the other techniques in terms of accuracy and statistical tests. Heart disease has been one of the leading causes of death worldwide in recent years. Among diagnostic methods for heart disease, angiography is one of the most common methods, but it is costly and has side effects. Given the difficulty of heart disease prediction, data mining can play an important role in predicting heart disease accurately. In this paper, by combining the multi-objective particle swarm optimization (MOPSO) and Random Forest, a new approach is proposed to predict heart disease. The main goal is to produce diverse and accurate decision trees and determine the (near) optimal number of them simultaneously. In this method, an evolutionary multi-objective approach is used instead of employing a commonly used approach, i.e., bootstrap, feature selection in the Random Forest, and random number selection of training sets. By doing so, different training sets with different samples and features for training each tree are generated. Also, the obtained solutions in Pareto-optimal fronts determine the required number of training sets to build the random forest. By doing so, the random forest's performance can be enhanced, and consequently, the prediction accuracy will be improved. The proposed method's effectiveness is investigated by comparing its performance over six heart datasets with individual and ensemble classifiers. The results suggest that the proposed method with the (near) optimal number of classifiers outperforms the random forest algorithm with different classifiers.
ISSN:1532-0464
1532-0480
DOI:10.1016/j.jbi.2021.103690