Loading…

Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE

In this paper, the fusion of two speaker recognition subsystems, one based on frequency modulation (FM) and another on MFCC features, is reported. The motivation for their fusion was to improve the recognition accuracy across different types of channel variations, since the two features are believed...

Full description

Saved in:
Bibliographic Details
Main Authors: Nosratighods, M., Thiruvaran, T., Epps, J., Ambikairajah, E., Bin Ma, Haizhou Li
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 4236
container_issue
container_start_page 4233
container_title
container_volume
creator Nosratighods, M.
Thiruvaran, T.
Epps, J.
Ambikairajah, E.
Bin Ma
Haizhou Li
description In this paper, the fusion of two speaker recognition subsystems, one based on frequency modulation (FM) and another on MFCC features, is reported. The motivation for their fusion was to improve the recognition accuracy across different types of channel variations, since the two features are believed to contain complementary information. It was found that the MFCC-based subsystem outperformed the FM-based subsystem on telephone conversations from NIST SRE-06 dataset, while the opposite was true for NIST SRE-08 telephone data. As a result, the FM-based subsystem performed as well as the MFCC-based subsystem and their fusion gave up to 23% relative improvement in terms of EER over the MFCC subsystem alone, when evaluated on the NIST 2008 core condition.
doi_str_mv 10.1109/ICASSP.2009.4960563
format conference_proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4960563</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4960563</ieee_id><sourcerecordid>4960563</sourcerecordid><originalsourceid>FETCH-LOGICAL-i220t-4dec016546d4891f1466e8af46d00f53a4c3ca1ecca45b85da9be1e42d6ecf573</originalsourceid><addsrcrecordid>eNpVUMtOAzEMDC-JqvQLeskPbIkTJ90cUVWgUnmILRISh8rNOrDQlzZbpP49C_SCL_Z4NKOxheiDGgAofzkZXRXF40Ar5QfonbLOHImeH-aAGlEbi_ZYdLQZ-gy8ejn5x5n8VHTAapU5QH8ueil9qLbQGkDbEa_jL1ruqKk2a7mJkmTcJS7l9Z2kdSkDb1NT0zJb0M82bZk-uZY1h83buvoVpX1qeCXbqXlneT8pZrJNmsviaXwhziItE_cOvSuer8ez0W02fbhpj5pmldaqybDkoMBZdCXmHiKgc5xTbLFS0RrCYAIBh0BoF7ktyS8YGHXpOEQ7NF3R__OtmHm-rasV1fv54VXmG5uNV-I</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Nosratighods, M. ; Thiruvaran, T. ; Epps, J. ; Ambikairajah, E. ; Bin Ma ; Haizhou Li</creator><creatorcontrib>Nosratighods, M. ; Thiruvaran, T. ; Epps, J. ; Ambikairajah, E. ; Bin Ma ; Haizhou Li</creatorcontrib><description>In this paper, the fusion of two speaker recognition subsystems, one based on frequency modulation (FM) and another on MFCC features, is reported. The motivation for their fusion was to improve the recognition accuracy across different types of channel variations, since the two features are believed to contain complementary information. It was found that the MFCC-based subsystem outperformed the FM-based subsystem on telephone conversations from NIST SRE-06 dataset, while the opposite was true for NIST SRE-08 telephone data. As a result, the FM-based subsystem performed as well as the MFCC-based subsystem and their fusion gave up to 23% relative improvement in terms of EER over the MFCC subsystem alone, when evaluated on the NIST 2008 core condition.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9781424423538</identifier><identifier>ISBN: 1424423538</identifier><identifier>EISSN: 2379-190X</identifier><identifier>EISBN: 9781424423545</identifier><identifier>EISBN: 1424423546</identifier><identifier>DOI: 10.1109/ICASSP.2009.4960563</identifier><language>eng</language><publisher>IEEE</publisher><subject>Australia ; Frequency estimation ; Frequency modulation ; Fusion ; Humans ; Mel frequency cepstral coefficient ; MFCC ; NIST ; Psychoacoustic models ; Resonance ; Speaker recognition ; Speech</subject><ispartof>2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009, p.4233-4236</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4960563$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54555,54920,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4960563$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Nosratighods, M.</creatorcontrib><creatorcontrib>Thiruvaran, T.</creatorcontrib><creatorcontrib>Epps, J.</creatorcontrib><creatorcontrib>Ambikairajah, E.</creatorcontrib><creatorcontrib>Bin Ma</creatorcontrib><creatorcontrib>Haizhou Li</creatorcontrib><title>Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE</title><title>2009 IEEE International Conference on Acoustics, Speech and Signal Processing</title><addtitle>ICASSP</addtitle><description>In this paper, the fusion of two speaker recognition subsystems, one based on frequency modulation (FM) and another on MFCC features, is reported. The motivation for their fusion was to improve the recognition accuracy across different types of channel variations, since the two features are believed to contain complementary information. It was found that the MFCC-based subsystem outperformed the FM-based subsystem on telephone conversations from NIST SRE-06 dataset, while the opposite was true for NIST SRE-08 telephone data. As a result, the FM-based subsystem performed as well as the MFCC-based subsystem and their fusion gave up to 23% relative improvement in terms of EER over the MFCC subsystem alone, when evaluated on the NIST 2008 core condition.</description><subject>Australia</subject><subject>Frequency estimation</subject><subject>Frequency modulation</subject><subject>Fusion</subject><subject>Humans</subject><subject>Mel frequency cepstral coefficient</subject><subject>MFCC</subject><subject>NIST</subject><subject>Psychoacoustic models</subject><subject>Resonance</subject><subject>Speaker recognition</subject><subject>Speech</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9781424423538</isbn><isbn>1424423538</isbn><isbn>9781424423545</isbn><isbn>1424423546</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2009</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNpVUMtOAzEMDC-JqvQLeskPbIkTJ90cUVWgUnmILRISh8rNOrDQlzZbpP49C_SCL_Z4NKOxheiDGgAofzkZXRXF40Ar5QfonbLOHImeH-aAGlEbi_ZYdLQZ-gy8ejn5x5n8VHTAapU5QH8ueil9qLbQGkDbEa_jL1ruqKk2a7mJkmTcJS7l9Z2kdSkDb1NT0zJb0M82bZk-uZY1h83buvoVpX1qeCXbqXlneT8pZrJNmsviaXwhziItE_cOvSuer8ez0W02fbhpj5pmldaqybDkoMBZdCXmHiKgc5xTbLFS0RrCYAIBh0BoF7ktyS8YGHXpOEQ7NF3R__OtmHm-rasV1fv54VXmG5uNV-I</recordid><startdate>20090101</startdate><enddate>20090101</enddate><creator>Nosratighods, M.</creator><creator>Thiruvaran, T.</creator><creator>Epps, J.</creator><creator>Ambikairajah, E.</creator><creator>Bin Ma</creator><creator>Haizhou Li</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20090101</creationdate><title>Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE</title><author>Nosratighods, M. ; Thiruvaran, T. ; Epps, J. ; Ambikairajah, E. ; Bin Ma ; Haizhou Li</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i220t-4dec016546d4891f1466e8af46d00f53a4c3ca1ecca45b85da9be1e42d6ecf573</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2009</creationdate><topic>Australia</topic><topic>Frequency estimation</topic><topic>Frequency modulation</topic><topic>Fusion</topic><topic>Humans</topic><topic>Mel frequency cepstral coefficient</topic><topic>MFCC</topic><topic>NIST</topic><topic>Psychoacoustic models</topic><topic>Resonance</topic><topic>Speaker recognition</topic><topic>Speech</topic><toplevel>online_resources</toplevel><creatorcontrib>Nosratighods, M.</creatorcontrib><creatorcontrib>Thiruvaran, T.</creatorcontrib><creatorcontrib>Epps, J.</creatorcontrib><creatorcontrib>Ambikairajah, E.</creatorcontrib><creatorcontrib>Bin Ma</creatorcontrib><creatorcontrib>Haizhou Li</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE/IET Electronic Library</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Nosratighods, M.</au><au>Thiruvaran, T.</au><au>Epps, J.</au><au>Ambikairajah, E.</au><au>Bin Ma</au><au>Haizhou Li</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE</atitle><btitle>2009 IEEE International Conference on Acoustics, Speech and Signal Processing</btitle><stitle>ICASSP</stitle><date>2009-01-01</date><risdate>2009</risdate><spage>4233</spage><epage>4236</epage><pages>4233-4236</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9781424423538</isbn><isbn>1424423538</isbn><eisbn>9781424423545</eisbn><eisbn>1424423546</eisbn><abstract>In this paper, the fusion of two speaker recognition subsystems, one based on frequency modulation (FM) and another on MFCC features, is reported. The motivation for their fusion was to improve the recognition accuracy across different types of channel variations, since the two features are believed to contain complementary information. It was found that the MFCC-based subsystem outperformed the FM-based subsystem on telephone conversations from NIST SRE-06 dataset, while the opposite was true for NIST SRE-08 telephone data. As a result, the FM-based subsystem performed as well as the MFCC-based subsystem and their fusion gave up to 23% relative improvement in terms of EER over the MFCC subsystem alone, when evaluated on the NIST 2008 core condition.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2009.4960563</doi><tpages>4</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1520-6149
ispartof 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009, p.4233-4236
issn 1520-6149
2379-190X
language eng
recordid cdi_ieee_primary_4960563
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Australia
Frequency estimation
Frequency modulation
Fusion
Humans
Mel frequency cepstral coefficient
MFCC
NIST
Psychoacoustic models
Resonance
Speaker recognition
Speech
title Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T11%3A08%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Evaluation%20of%20a%20fused%20FM%20and%20cepstral-based%20speaker%20recognition%20system%20on%20the%20NIST%202008%20SRE&rft.btitle=2009%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech%20and%20Signal%20Processing&rft.au=Nosratighods,%20M.&rft.date=2009-01-01&rft.spage=4233&rft.epage=4236&rft.pages=4233-4236&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9781424423538&rft.isbn_list=1424423538&rft_id=info:doi/10.1109/ICASSP.2009.4960563&rft.eisbn=9781424423545&rft.eisbn_list=1424423546&rft_dat=%3Cieee_6IE%3E4960563%3C/ieee_6IE%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i220t-4dec016546d4891f1466e8af46d00f53a4c3ca1ecca45b85da9be1e42d6ecf573%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4960563&rfr_iscdi=true