Loading…

Enhancing the Performance of Gaussian Mixture Model-Based Text Independent Speaker Identification

In this paper, we seek to enhance the identification performance of Gaussian Mixture Model (GMM)-based speaker identification systems in the presence of a limited amount of training data & a relatively large number of speakers. The performance is characterized by the identification accuracy, the...

Full description

Saved in:
Bibliographic Details
Published in:International journal of speech technology 2005-01, Vol.8 (1), p.93-103
Main Authors: El-Gamal, M A, El-Yazeed, M F Abu, El Ayadi, M M H
Format: Article
Language:English
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this paper, we seek to enhance the identification performance of Gaussian Mixture Model (GMM)-based speaker identification systems in the presence of a limited amount of training data & a relatively large number of speakers. The performance is characterized by the identification accuracy, the identification time, & the model complexity. A new model order selection technique based on the Goodness of Fit (GOF) statistical test is proposed in order to increase the identification accuracy. This technique has shown to outperform other well known model order selection techniques like the Minimum Description Length (MDL) & the Akaike Information Criterion (AIC) in terms of the identification accuracy & the robustness against telephone channel degradation effects. In addition, the identification time is decreased by adapting the Linear Discriminative Analysis (LDA) feature extraction technique to fit our basic assumption of asymmetric multimodal distribution of the training data of each speaker. This modification results in a large decrease in the identification time with a little effect on the identification accuracy. 2 Tables, 4 Figures, 18 References. Adapted from the source document
ISSN:1381-2416
1572-8110
DOI:10.1007/s10772-005-4764-8