Loading…

Markov blanket-based approach for learning multi-dimensional Bayesian network classifiers: An application to predict the European Quality of Life-5 Dimensions (EQ-5D) from the 39-item Parkinson’s Disease Questionnaire (PDQ-39)

[Display omitted] ► MB-MBC is a new approach for learning multi-dimensional Bayesian network classifiers. ► MB-MBC is a constraint-based approach based on building Markov blankets. ► MB-MBC is evaluated using synthetic and real-world Parkinson’s disease data sets. ► Experimental study shows promisin...

Full description

Saved in:
Bibliographic Details
Published in:Journal of biomedical informatics 2012-12, Vol.45 (6), p.1175-1184
Main Authors: Borchani, Hanen, Bielza, Concha, Martı´nez-Martı´n, Pablo, Larrañaga, Pedro
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:[Display omitted] ► MB-MBC is a new approach for learning multi-dimensional Bayesian network classifiers. ► MB-MBC is a constraint-based approach based on building Markov blankets. ► MB-MBC is evaluated using synthetic and real-world Parkinson’s disease data sets. ► Experimental study shows promising results compared with state-of-the-art approaches. Multi-dimensional Bayesian network classifiers (MBCs) are probabilistic graphical models recently proposed to deal with multi-dimensional classification problems, where each instance in the data set has to be assigned to more than one class variable. In this paper, we propose a Markov blanket-based approach for learning MBCs from data. Basically, it consists of determining the Markov blanket around each class variable using the HITON algorithm, then specifying the directionality over the MBC subgraphs. Our approach is applied to the prediction problem of the European Quality of Life-5 Dimensions (EQ-5D) from the 39-item Parkinson’s Disease Questionnaire (PDQ-39) in order to estimate the health-related quality of life of Parkinson’s patients. Fivefold cross-validation experiments were carried out on randomly generated synthetic data sets, Yeast data set, as well as on a real-world Parkinson’s disease data set containing 488 patients. The experimental study, including comparison with additional Bayesian network-based approaches, back propagation for multi-label learning, multi-label k-nearest neighbor, multinomial logistic regression, ordinary least squares, and censored least absolute deviations, shows encouraging results in terms of predictive accuracy as well as the identification of dependence relationships among class and feature variables.
ISSN:1532-0464
1532-0480
DOI:10.1016/j.jbi.2012.07.010