Loading…
A Stahel–Donoho estimator based on huberized outlyingness
The Stahel–Donoho estimator is defined as a weighted mean and covariance, where the weight of each observation depends on a measure of its outlyingness. In high dimensions, it can easily happen that a number of outlying measurements are present in such a way that the majority of observations are con...
Saved in:
Published in: | Computational statistics & data analysis 2012-03, Vol.56 (3), p.531-542 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The Stahel–Donoho estimator is defined as a weighted mean and covariance, where the weight of each observation depends on a measure of its outlyingness. In high dimensions, it can easily happen that a number of outlying measurements are present in such a way that the majority of observations are contaminated in at least one of their components. In these situations, the Stahel–Donoho estimator has difficulties in identifying the actual outlyingness of the contaminated observations. An adaptation of the Stahel–Donoho estimator is presented in which the data are huberized before the outlyingness is computed. It is shown that the huberized outlyingness better reflects the actual outlyingness of each observation towards the non-contaminated observations. Therefore, the resulting adapted Stahel–Donoho estimator can better withstand large numbers of outliers. It is demonstrated that the Stahel–Donoho estimator based on huberized outlyingness works especially well when the data are heavily contaminated. |
---|---|
ISSN: | 0167-9473 1872-7352 |
DOI: | 10.1016/j.csda.2011.08.014 |