Loading…

Validation Strategies for Multiple Regression Analysis: Using the Coefficient of Determination

Multiple regression equations designed to explain or predict should be validated. This tutorial shows how recalculation of the coefficient of determination on hold-out sample data or new sample data can be used to improve regression equations and to test them for validity. The Herzberg equation is u...

Full description

Saved in:
Bibliographic Details
Published in:Interfaces (Providence) 1991-11, Vol.21 (6), p.106-120
Main Author: Sobol, Marion G
Format: Article
Language:English
Subjects:
Citations: Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Multiple regression equations designed to explain or predict should be validated. This tutorial shows how recalculation of the coefficient of determination on hold-out sample data or new sample data can be used to improve regression equations and to test them for validity. The Herzberg equation is used as a criterion for acceptable shrinkage when the coefficient of determination is calculated on new data. Nevertheless, validation is an art rather than a science because elimination of unstable variables as well as different types of data splitting, use of new sample data, and adjustments for external differences when test samples are used from different time periods can lead to different decisions on whether the equations have been validated. Various strategies can be used to find effective validation techniques.
ISSN:0092-2102
2644-0865
1526-551X
2644-0873
DOI:10.1287/inte.21.6.106