Loading…
Hybrid approach for speaker recognition based on formant and pitch extraction
Human voice is an ideal data source for identifying people in many applications. Because of the increasing need for security in different public places, voice biometrics may be a good solution, as we can easily take voice records. This paper provides a brief overview of the approaches utilized in re...
Saved in:
Main Authors: | , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Human voice is an ideal data source for identifying people in many applications. Because of the increasing need for security in different public places, voice biometrics may be a good solution, as we can easily take voice records. This paper provides a brief overview of the approaches utilized in recognizing speakers, and then presents a novel approach for recognizing speakers in degraded smart-home conditions. The suggested approach includes a pre-processing phase, a feature extraction phase, and a classification phase, where the feature extraction phase consists of formant extraction to get the spectrum energy maxima of speech audio, dynamic time warping (DTW)to find an optimal alignment between two provided temporal sequences under definite restrictions, and refinement process to improve the results of the DTW system output. The experiments are carried out on a database containing 1,248 samples in order to validate the suggested approach. The latter has good results as regards the state of the art with 94.5% accuracy. |
---|---|
ISSN: | 2642-3596 |
DOI: | 10.1109/CW58918.2023.00064 |