Loading…

Beware of R2: simple, unambiguous assessment of the prediction accuracy of QSAR and QSPR models

The statistical metrics used to characterize the external predictivity of a model, i.e., how well it predicts the properties of an independent test set, have proliferated over the past decade. This paper clarifies some apparent confusion over the use of the coefficient of determination, R 2 , as a m...

Full description

Saved in:
Bibliographic Details
Published in:Journal of chemical information and modeling 2015-07, Vol.55 (7), p.1316-1322
Main Authors: Alexander, D. L. J., Tropsha, A., Winkler, David A.
Format: Article
Language:eng ; jpn
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The statistical metrics used to characterize the external predictivity of a model, i.e., how well it predicts the properties of an independent test set, have proliferated over the past decade. This paper clarifies some apparent confusion over the use of the coefficient of determination, R 2 , as a measure of model fit and predictive power in QSAR and QSPR modelling. R 2 (or R 2 ) has been used in various contexts in the literature in conjunction with training and test data, for both ordinary linear regression and regression through the origin as well as with linear and nonlinear regression models. We analyze the widely adopted model fit criteria suggested by Golbraikh and Tropsha 1 in a strict statistical manner. Shortcomings in these criteria are identified and a clearer and simpler alternative method to characterize model predictivity is provided. The intent is not to repeat the well-documented arguments for model validation using test data, but to guide the application of R 2 as a model fit statistic. Examples are used to illustrate both correct and incorrect use of R 2 . Reporting the root mean squared error or equivalent measures of dispersion, typically of more practical importance than R 2 , is also encouraged and important challenges in addressing the needs of different categories of users such as computational chemists, experimental scientists, and regulatory decision support specialists are outlined.
ISSN:1549-9596
1549-960X
DOI:10.1021/acs.jcim.5b00206