Loading…
On the Choice of a Model to Fit Data from an Exponential Family
Let X1, ⋯, Xnbe iid observations coming from an exponential family. The problem of interest is this: Given a finite number of models mj(smoothly curved manifolds in Rk), choose the best model to fit the observations, with some penalty for choosing models with dimensions which are too large. A result...
Saved in:
Published in: | The Annals of statistics 1988-03, Vol.16 (1), p.342-355 |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Let X1, ⋯, Xnbe iid observations coming from an exponential family. The problem of interest is this: Given a finite number of models mj(smoothly curved manifolds in Rk), choose the best model to fit the observations, with some penalty for choosing models with dimensions which are too large. A result of Schwarz is made more specific and is extended to the case where the models are curved manifolds. If S(Y, n, j) is--up to a constant C(n) independent of the model--the log of the posterior probability of the jth model, where the sample mean Yn= (1/n)∑n
i=1Xihas been replaced by Y, Schwarz suggested an asymptotic expansion of S(Y, n, j) whose leading terms are$\gamma(Y, n, j) = n \sup_{\psi \in m_j \cap \Theta}(Y\psi - b(\psi)) - \frac{1}{2}k_j \log n$, in the case where the models are affine subspaces of Rk. We establish a similar asymptotic expansion, including the next term, with uniform bounds for Y in a compact neighborhood of ∇ b(θ), where θ is the true value of the parameter. We suggest a criterion for the choice of the best model that consists of maximizing the three leading terms in the expansion S(Y, n, j). We show that the criterion gives the correct model with probabilities Pn
θ→ 1 as n → + ∞. |
---|---|
ISSN: | 0090-5364 2168-8966 |
DOI: | 10.1214/aos/1176350709 |