Loading…

Mixture analysis of multivariate categorical data with covariates and missing entries

Longitudinal or otherwise correlated categorical variables are typically related to some covariates and exhibit nonignorable correlations of the observed variables. A further complication often consists in missing entries. For analyzing such data, it is proposed to create an extra missing category a...

Full description

Saved in:
Bibliographic Details
Published in:Computational statistics & data analysis 2007-07, Vol.51 (11), p.5236-5246
Main Author: Formann, Anton K.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Longitudinal or otherwise correlated categorical variables are typically related to some covariates and exhibit nonignorable correlations of the observed variables. A further complication often consists in missing entries. For analyzing such data, it is proposed to create an extra missing category and to employ latent class analysis which, regarding missing data, can be shown to belong to the family of nonmissing at random models. By treating the complete and the incomplete cases jointly, it becomes possible to estimate the parameters of interest along with additional parameters characterizing the missing mechanism. Data from the Muscatine Coronary Risk Factor Study, where each child was classified obese or not obese at three occasions, serve as an illustrative example. Previous analyses resulted in significant interaction of age and sex for the complete data ( N = 460 ) , and in a linear increase in the logit of the rate of obesity over time for the incomplete data, with no effect of the covariate sex ( N = 1014 ) . Reanalyses employing latent class models do not support these findings. The finally accepted two-classes model for the complete data assumes a linear effect of age which is the same for boys and girls. The incomplete data were considered three-categorical (not obese, obese, missing) and resulted in a more complex model only in part supporting the linear age hypothesis.
ISSN:0167-9473
1872-7352
DOI:10.1016/j.csda.2006.08.020