Loading…
A general framework of online updating variable selection for generalized linear models with streaming datasets
In the era of big data, one of the important issues is how to recover the sets of true features when the data sets arrive sequentially. The paper presents a general framework for online updating variable selection and parameter estimation in generalized linear models with streaming datasets. This is...
Saved in:
Published in: | Journal of statistical computation and simulation 2023-02, Vol.93 (3), p.325-340 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | In the era of big data, one of the important issues is how to recover the sets of true features when the data sets arrive sequentially. The paper presents a general framework for online updating variable selection and parameter estimation in generalized linear models with streaming datasets. This is a type of online updating penalized likelihoods with differentiable or non-differentiable penalty functions. An online updating coordinate descent algorithm is proposed for solving the online updating optimization problem. Moreover, a tuning parameter selection is suggested in an online updating way. The selection and estimation consistencies and the oracle property are established, theoretically. Our methods are further examined and illustrated by various numerical examples from both simulation experiments and a real data analysis. |
---|---|
ISSN: | 0094-9655 1563-5163 |
DOI: | 10.1080/00949655.2022.2107207 |