Loading…

On the Determination of Optimal Model Order for GMM-Based Text-Independent Speaker Identification

Gaussian mixture models (GMMs) are recently employed to provide a robust technique for speaker identification. The determination of the appropriate number of Gaussian components in a model for adequate speaker representation is a crucial but difficult problem. This number is in fact speaker dependen...

Full description

Saved in:
Bibliographic Details
Published in:EURASIP journal on advances in signal processing 2004-07, Vol.2004 (8), p.1078-1087
Main Authors: M. F. Abu El-Yazeed, M. A. El Gamal, M. M. H. El Ayadi
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Gaussian mixture models (GMMs) are recently employed to provide a robust technique for speaker identification. The determination of the appropriate number of Gaussian components in a model for adequate speaker representation is a crucial but difficult problem. This number is in fact speaker dependent. Therefore, assuming a fixed number of Gaussian components for all speakers is not justified. In this paper, we develop a procedure for roughly estimating the maximum possible model order above which the estimation of model parameters becomes unreliable. In addition, a theoretical measure, namely, a goodness of fit (GOF) measure is derived and utilized in estimating the number of Gaussian components needed to characterize different speakers. The estimation is carried out by exploiting the distribution of the training data for each speaker. Experimental results indicate that the proposed technique provides comparable results to other well-known model selection criteria like the minimum description length (MDL) and the Akaike information criterion (AIC).
ISSN:1687-6172
1687-6180
DOI:10.1155/S1687617204312205