Loading…

Robust Estimation and Outlier Detection for Overdispersed Multinomial Models of Count Data

We develop a robust estimator-the hyperbolic tangent (tanh) estimator-for overdispersed multinomial regression models of count data. The tanh estimator provides accurate estimates and reliable inferences even when the specified model is not good for as much as half of the data. Seriously ill-fitted...

Full description

Saved in:
Bibliographic Details
Published in:American journal of political science 2004-04, Vol.48 (2), p.392-411
Main Authors: Mebane, Walter R., Sekhon, Jasjeet S.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We develop a robust estimator-the hyperbolic tangent (tanh) estimator-for overdispersed multinomial regression models of count data. The tanh estimator provides accurate estimates and reliable inferences even when the specified model is not good for as much as half of the data. Seriously ill-fitted counts-outliers-are identified as part of the estimation. A Monte Carlo sampling experiment shows that the tanh estimator produces good results at practical sample sizes even when ten percent of the data are generated by a significantly different process. The experiment shows that, with contaminated data, estimation fails using four other estimators: the nonrobust maximum likelihood estimator, the additive logistic model and two SUR models. Using the tanh estimator to analyze data from Florida for the 2000 presidential election matches well-known features of the election that the other four estimators fail to capture. In an analysis of data from the 1993 Polish parliamentary election, the tanh estimator gives sharper inferences than does a previously proposed heteroskedastic SUR model.
ISSN:0092-5853
1540-5907
DOI:10.1111/j.0092-5853.2004.00077.x