Loading…

Overfitting revisited: an information-theoretic approach to simplifying discrimination trees

This paper describes a method of simplifying inductively generated discrimination trees using a measure of tree quality based on the principle of information economy, which takes into account both the size of the tree and the size of the outcome data after (notional) encoding by that tree. Results o...

Full description

Saved in:
Bibliographic Details
Published in:Journal of experimental & theoretical artificial intelligence 1994-07, Vol.6 (3), p.289-302
Main Authors: FORSYTH, RICHARD S., CLARKE, DAVID D., WRIGHT, RICHARD L.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper describes a method of simplifying inductively generated discrimination trees using a measure of tree quality based on the principle of information economy, which takes into account both the size of the tree and the size of the outcome data after (notional) encoding by that tree. Results of testing this method on a selection of data sets show that it has some practical advantages over previously used techniques for tree-pruning. Some of the theoretical implications of the present method are also discussed.
ISSN:0952-813X
1362-3079
DOI:10.1080/09528139408953790