Loading…

A unified approach for outliers and influential data detection: The value of information in retrospect

Identifying influential and outlying data is important as it would guide the effective collection of future data and the proper use of existing information. We develop a unified approach for outlier detection and influence analysis. Our proposed method is grounded in the intuitive value of informati...

Full description

Saved in:
Bibliographic Details
Published in:Stat (International Statistical Institute) 2022-12, Vol.11 (1), p.n/a
Main Authors: Parsons, Jacob, Bao, Le
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Identifying influential and outlying data is important as it would guide the effective collection of future data and the proper use of existing information. We develop a unified approach for outlier detection and influence analysis. Our proposed method is grounded in the intuitive value of information concepts and has a distinct advantage in interpretability and flexibility when compared to existing methods: It decomposes the data influence into the leverage effect (expected to be influential) and the outlying effect (surprisingly more influential than being expected); and it applies to all decision problems such as estimation, prediction and hypothesis testing. We study the theoretical properties of three values of information quantities, establish the relationship between the proposed measures and classic measures in the linear regression setting and provide real data analysis examples of how to apply the new value of information approach in the cases of linear regression, generalized linear mixed models and hypothesis testing.
ISSN:2049-1573
2049-1573
DOI:10.1002/sta4.442