Loading…
A unified approach for outliers and influential data detection: The value of information in retrospect
Identifying influential and outlying data is important as it would guide the effective collection of future data and the proper use of existing information. We develop a unified approach for outlier detection and influence analysis. Our proposed method is grounded in the intuitive value of informati...
Saved in:
Published in: | Stat (International Statistical Institute) 2022-12, Vol.11 (1), p.n/a |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Identifying influential and outlying data is important as it would guide the effective collection of future data and the proper use of existing information. We develop a unified approach for outlier detection and influence analysis. Our proposed method is grounded in the intuitive value of information concepts and has a distinct advantage in interpretability and flexibility when compared to existing methods: It decomposes the data influence into the leverage effect (expected to be influential) and the outlying effect (surprisingly more influential than being expected); and it applies to all decision problems such as estimation, prediction and hypothesis testing. We study the theoretical properties of three values of information quantities, establish the relationship between the proposed measures and classic measures in the linear regression setting and provide real data analysis examples of how to apply the new value of information approach in the cases of linear regression, generalized linear mixed models and hypothesis testing. |
---|---|
ISSN: | 2049-1573 2049-1573 |
DOI: | 10.1002/sta4.442 |