Loading…

Hybrid approach for speaker recognition based on formant and pitch extraction

Human voice is an ideal data source for identifying people in many applications. Because of the increasing need for security in different public places, voice biometrics may be a good solution, as we can easily take voice records. This paper provides a brief overview of the approaches utilized in re...

Full description

Saved in:
Bibliographic Details
Main Authors: Boujnah, Sana, Ferjaoui, Radhia, Khalifa, Anouar Ben
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Human voice is an ideal data source for identifying people in many applications. Because of the increasing need for security in different public places, voice biometrics may be a good solution, as we can easily take voice records. This paper provides a brief overview of the approaches utilized in recognizing speakers, and then presents a novel approach for recognizing speakers in degraded smart-home conditions. The suggested approach includes a pre-processing phase, a feature extraction phase, and a classification phase, where the feature extraction phase consists of formant extraction to get the spectrum energy maxima of speech audio, dynamic time warping (DTW)to find an optimal alignment between two provided temporal sequences under definite restrictions, and refinement process to improve the results of the DTW system output. The experiments are carried out on a database containing 1,248 samples in order to validate the suggested approach. The latter has good results as regards the state of the art with 94.5% accuracy.
ISSN:2642-3596
DOI:10.1109/CW58918.2023.00064