Loading…

Tree-Based Methods as an Alternative to Logistic Regression in Revealing Risk Factors of Crib-Biting in Horses

Determining the risk factors might help in designing prevention of crib-biting. Logistic regression is a commonly used statistical method for finding risk factors, but tree-based methods are also getting more popular. An important difference between these two statistical approaches is that logistic...

Full description

Saved in:
Bibliographic Details
Published in:Journal of equine veterinary science 2010, Vol.30 (1), p.21-26
Main Authors: Nagy, Krisztina, Reiczigel, Jenő, Harnos, Andrea, Schrott, Anikó, Kabai, Péter
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Determining the risk factors might help in designing prevention of crib-biting. Logistic regression is a commonly used statistical method for finding risk factors, but tree-based methods are also getting more popular. An important difference between these two statistical approaches is that logistic regression makes a number of assumptions about the underlying data, whereas tree-based methods do not. Another difference is that logistic regression can be used to derive odds ratios for the significant risk factors, whereas tree-based methods create a tree where the ramifications represent the risk factors. The probability of occurrence is assigned to each end of branch in the tree. Data of horses used for noncompetition purposes were analyzed with three statistical approaches: logistic regression, classification tree, and conditional inference tree methods. By this, we compared the advantages and disadvantages of these statistical methods. No difference was found between the two tree-based methods regarding the structure and prediction accuracy of the trees. Compared to them, logistic regression revealed fewer risk factors, and also the number of the stereotypic horses classified correctly by the model was less. The representation of the tree-based methods is closer to medical reasoning and also high-order interaction of the risk-factors can easily be visualized. Our results suggest that tree-based methods can be a new alternative in revealing risk factors, even if used alone or together with logistic regression.
ISSN:0737-0806
1542-7412
DOI:10.1016/j.jevs.2009.11.005