Loading…

Check your outliers! An introduction to identifying statistical outliers in R with easystats

Beyond the challenge of keeping up to date with current best practices regarding the diagnosis and treatment of outliers, an additional difficulty arises concerning the mathematical implementation of the recommended methods. Here, we provide an overview of current recommendations and best practices...

Full description

Saved in:
Bibliographic Details
Published in:Behavior research methods 2024, Vol.56 (4), p.4162-4172
Main Authors: Thériault, Rémi, Ben-Shachar, Mattan S., Patil, Indrajeet, Lüdecke, Daniel, Wiernik, Brenton M., Makowski, Dominique
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Beyond the challenge of keeping up to date with current best practices regarding the diagnosis and treatment of outliers, an additional difficulty arises concerning the mathematical implementation of the recommended methods. Here, we provide an overview of current recommendations and best practices and demonstrate how they can easily and conveniently be implemented in the R statistical computing software, using the {performance} package of the easystats ecosystem. We cover univariate, multivariate, and model-based statistical outlier detection methods, their recommended threshold, standard output, and plotting methods. We conclude by reviewing the different theoretical types of outliers, whether to exclude or winsorize them, and the importance of transparency. A preprint of this paper is available at: 10.31234/osf.io/bu6nt.
ISSN:1554-3528
1554-3528
DOI:10.3758/s13428-024-02356-w