Loading…
Penalized Generalized Estimating Equations for High‐Dimensional Longitudinal Data Analysis
We consider the penalized generalized estimating equations (GEEs) for analyzing longitudinal data with high‐dimensional covariates, which often arise in microarray experiments and large‐scale health studies. Existing high‐dimensional regression procedures often assume independent data and rely on th...
Saved in:
Published in: | Biometrics 2012-06, Vol.68 (2), p.353-360 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | We consider the penalized generalized estimating equations (GEEs) for analyzing longitudinal data with high‐dimensional covariates, which often arise in microarray experiments and large‐scale health studies. Existing high‐dimensional regression procedures often assume independent data and rely on the likelihood function. Construction of a feasible joint likelihood function for high‐dimensional longitudinal data is challenging, particularly for correlated discrete outcome data. The penalized GEE procedure only requires specifying the first two marginal moments and a working correlation structure. We establish the asymptotic theory in a high‐dimensional framework where the number of covariates pn increases as the number of clusters n increases, and pn can reach the same order as n. One important feature of the new procedure is that the consistency of model selection holds even if the working correlation structure is misspecified. We evaluate the performance of the proposed method using Monte Carlo simulations and demonstrate its application using a yeast cell‐cycle gene expression data set. |
---|---|
ISSN: | 0006-341X 1541-0420 |
DOI: | 10.1111/j.1541-0420.2011.01678.x |