Loading…

A Simple Two-Sample Test in High Dimensions Based on L2-Norm

Testing the equality of two means is a fundamental inference problem. For high-dimensional data, the Hotelling’s T2-test either performs poorly or becomes inapplicable. Several modifications have been proposed to address this issue. However, most of them are based on asymptotic normality of the null...

Full description

Saved in:
Bibliographic Details
Published in:Journal of the American Statistical Association 2020-06, Vol.115 (530), p.1011-1027
Main Authors: Jin-Ting, Zhang, Guo, Jia, Zhou, Bu, Ming-Yen, Cheng
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Testing the equality of two means is a fundamental inference problem. For high-dimensional data, the Hotelling’s T2-test either performs poorly or becomes inapplicable. Several modifications have been proposed to address this issue. However, most of them are based on asymptotic normality of the null distributions of their test statistics which inevitably requires strong assumptions on the covariance. We study this problem thoroughly and propose an L2-norm based test that works under mild conditions and even when there are fewer observations than the dimension. Specially, to cope with general nonnormality of the null distribution we employ the Welch–Satterthwaite χ2-approximation. We derive a sharp upper bound on the approximation error and use it to justify that χ2-approximation is preferred to normal approximation. Simple ratio-consistent estimators for the parameters in the χ2-approximation are given. Importantly, our test can cope with singularity or near singularity of the covariance which is commonly seen in high dimensions and is the main cause of nonnormality. The power of the proposed test is also investigated. Extensive simulation studies and an application show that our test is at least comparable to and often outperforms several competitors in terms of size control, and the powers are comparable when their sizes are. Supplementary materials for this article are available online.
ISSN:0162-1459
1537-274X
DOI:10.1080/01621459.2019.1604366