Loading…
Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE
In this paper, the fusion of two speaker recognition subsystems, one based on frequency modulation (FM) and another on MFCC features, is reported. The motivation for their fusion was to improve the recognition accuracy across different types of channel variations, since the two features are believed...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | 4236 |
container_issue | |
container_start_page | 4233 |
container_title | |
container_volume | |
creator | Nosratighods, M. Thiruvaran, T. Epps, J. Ambikairajah, E. Bin Ma Haizhou Li |
description | In this paper, the fusion of two speaker recognition subsystems, one based on frequency modulation (FM) and another on MFCC features, is reported. The motivation for their fusion was to improve the recognition accuracy across different types of channel variations, since the two features are believed to contain complementary information. It was found that the MFCC-based subsystem outperformed the FM-based subsystem on telephone conversations from NIST SRE-06 dataset, while the opposite was true for NIST SRE-08 telephone data. As a result, the FM-based subsystem performed as well as the MFCC-based subsystem and their fusion gave up to 23% relative improvement in terms of EER over the MFCC subsystem alone, when evaluated on the NIST 2008 core condition. |
doi_str_mv | 10.1109/ICASSP.2009.4960563 |
format | conference_proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4960563</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4960563</ieee_id><sourcerecordid>4960563</sourcerecordid><originalsourceid>FETCH-LOGICAL-i220t-4dec016546d4891f1466e8af46d00f53a4c3ca1ecca45b85da9be1e42d6ecf573</originalsourceid><addsrcrecordid>eNpVUMtOAzEMDC-JqvQLeskPbIkTJ90cUVWgUnmILRISh8rNOrDQlzZbpP49C_SCL_Z4NKOxheiDGgAofzkZXRXF40Ar5QfonbLOHImeH-aAGlEbi_ZYdLQZ-gy8ejn5x5n8VHTAapU5QH8ueil9qLbQGkDbEa_jL1ruqKk2a7mJkmTcJS7l9Z2kdSkDb1NT0zJb0M82bZk-uZY1h83buvoVpX1qeCXbqXlneT8pZrJNmsviaXwhziItE_cOvSuer8ez0W02fbhpj5pmldaqybDkoMBZdCXmHiKgc5xTbLFS0RrCYAIBh0BoF7ktyS8YGHXpOEQ7NF3R__OtmHm-rasV1fv54VXmG5uNV-I</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Nosratighods, M. ; Thiruvaran, T. ; Epps, J. ; Ambikairajah, E. ; Bin Ma ; Haizhou Li</creator><creatorcontrib>Nosratighods, M. ; Thiruvaran, T. ; Epps, J. ; Ambikairajah, E. ; Bin Ma ; Haizhou Li</creatorcontrib><description>In this paper, the fusion of two speaker recognition subsystems, one based on frequency modulation (FM) and another on MFCC features, is reported. The motivation for their fusion was to improve the recognition accuracy across different types of channel variations, since the two features are believed to contain complementary information. It was found that the MFCC-based subsystem outperformed the FM-based subsystem on telephone conversations from NIST SRE-06 dataset, while the opposite was true for NIST SRE-08 telephone data. As a result, the FM-based subsystem performed as well as the MFCC-based subsystem and their fusion gave up to 23% relative improvement in terms of EER over the MFCC subsystem alone, when evaluated on the NIST 2008 core condition.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9781424423538</identifier><identifier>ISBN: 1424423538</identifier><identifier>EISSN: 2379-190X</identifier><identifier>EISBN: 9781424423545</identifier><identifier>EISBN: 1424423546</identifier><identifier>DOI: 10.1109/ICASSP.2009.4960563</identifier><language>eng</language><publisher>IEEE</publisher><subject>Australia ; Frequency estimation ; Frequency modulation ; Fusion ; Humans ; Mel frequency cepstral coefficient ; MFCC ; NIST ; Psychoacoustic models ; Resonance ; Speaker recognition ; Speech</subject><ispartof>2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009, p.4233-4236</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4960563$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54555,54920,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4960563$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Nosratighods, M.</creatorcontrib><creatorcontrib>Thiruvaran, T.</creatorcontrib><creatorcontrib>Epps, J.</creatorcontrib><creatorcontrib>Ambikairajah, E.</creatorcontrib><creatorcontrib>Bin Ma</creatorcontrib><creatorcontrib>Haizhou Li</creatorcontrib><title>Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE</title><title>2009 IEEE International Conference on Acoustics, Speech and Signal Processing</title><addtitle>ICASSP</addtitle><description>In this paper, the fusion of two speaker recognition subsystems, one based on frequency modulation (FM) and another on MFCC features, is reported. The motivation for their fusion was to improve the recognition accuracy across different types of channel variations, since the two features are believed to contain complementary information. It was found that the MFCC-based subsystem outperformed the FM-based subsystem on telephone conversations from NIST SRE-06 dataset, while the opposite was true for NIST SRE-08 telephone data. As a result, the FM-based subsystem performed as well as the MFCC-based subsystem and their fusion gave up to 23% relative improvement in terms of EER over the MFCC subsystem alone, when evaluated on the NIST 2008 core condition.</description><subject>Australia</subject><subject>Frequency estimation</subject><subject>Frequency modulation</subject><subject>Fusion</subject><subject>Humans</subject><subject>Mel frequency cepstral coefficient</subject><subject>MFCC</subject><subject>NIST</subject><subject>Psychoacoustic models</subject><subject>Resonance</subject><subject>Speaker recognition</subject><subject>Speech</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9781424423538</isbn><isbn>1424423538</isbn><isbn>9781424423545</isbn><isbn>1424423546</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2009</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNpVUMtOAzEMDC-JqvQLeskPbIkTJ90cUVWgUnmILRISh8rNOrDQlzZbpP49C_SCL_Z4NKOxheiDGgAofzkZXRXF40Ar5QfonbLOHImeH-aAGlEbi_ZYdLQZ-gy8ejn5x5n8VHTAapU5QH8ueil9qLbQGkDbEa_jL1ruqKk2a7mJkmTcJS7l9Z2kdSkDb1NT0zJb0M82bZk-uZY1h83buvoVpX1qeCXbqXlneT8pZrJNmsviaXwhziItE_cOvSuer8ez0W02fbhpj5pmldaqybDkoMBZdCXmHiKgc5xTbLFS0RrCYAIBh0BoF7ktyS8YGHXpOEQ7NF3R__OtmHm-rasV1fv54VXmG5uNV-I</recordid><startdate>20090101</startdate><enddate>20090101</enddate><creator>Nosratighods, M.</creator><creator>Thiruvaran, T.</creator><creator>Epps, J.</creator><creator>Ambikairajah, E.</creator><creator>Bin Ma</creator><creator>Haizhou Li</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20090101</creationdate><title>Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE</title><author>Nosratighods, M. ; Thiruvaran, T. ; Epps, J. ; Ambikairajah, E. ; Bin Ma ; Haizhou Li</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i220t-4dec016546d4891f1466e8af46d00f53a4c3ca1ecca45b85da9be1e42d6ecf573</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2009</creationdate><topic>Australia</topic><topic>Frequency estimation</topic><topic>Frequency modulation</topic><topic>Fusion</topic><topic>Humans</topic><topic>Mel frequency cepstral coefficient</topic><topic>MFCC</topic><topic>NIST</topic><topic>Psychoacoustic models</topic><topic>Resonance</topic><topic>Speaker recognition</topic><topic>Speech</topic><toplevel>online_resources</toplevel><creatorcontrib>Nosratighods, M.</creatorcontrib><creatorcontrib>Thiruvaran, T.</creatorcontrib><creatorcontrib>Epps, J.</creatorcontrib><creatorcontrib>Ambikairajah, E.</creatorcontrib><creatorcontrib>Bin Ma</creatorcontrib><creatorcontrib>Haizhou Li</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE/IET Electronic Library</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Nosratighods, M.</au><au>Thiruvaran, T.</au><au>Epps, J.</au><au>Ambikairajah, E.</au><au>Bin Ma</au><au>Haizhou Li</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE</atitle><btitle>2009 IEEE International Conference on Acoustics, Speech and Signal Processing</btitle><stitle>ICASSP</stitle><date>2009-01-01</date><risdate>2009</risdate><spage>4233</spage><epage>4236</epage><pages>4233-4236</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9781424423538</isbn><isbn>1424423538</isbn><eisbn>9781424423545</eisbn><eisbn>1424423546</eisbn><abstract>In this paper, the fusion of two speaker recognition subsystems, one based on frequency modulation (FM) and another on MFCC features, is reported. The motivation for their fusion was to improve the recognition accuracy across different types of channel variations, since the two features are believed to contain complementary information. It was found that the MFCC-based subsystem outperformed the FM-based subsystem on telephone conversations from NIST SRE-06 dataset, while the opposite was true for NIST SRE-08 telephone data. As a result, the FM-based subsystem performed as well as the MFCC-based subsystem and their fusion gave up to 23% relative improvement in terms of EER over the MFCC subsystem alone, when evaluated on the NIST 2008 core condition.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2009.4960563</doi><tpages>4</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1520-6149 |
ispartof | 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009, p.4233-4236 |
issn | 1520-6149 2379-190X |
language | eng |
recordid | cdi_ieee_primary_4960563 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Australia Frequency estimation Frequency modulation Fusion Humans Mel frequency cepstral coefficient MFCC NIST Psychoacoustic models Resonance Speaker recognition Speech |
title | Evaluation of a fused FM and cepstral-based speaker recognition system on the NIST 2008 SRE |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T11%3A08%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Evaluation%20of%20a%20fused%20FM%20and%20cepstral-based%20speaker%20recognition%20system%20on%20the%20NIST%202008%20SRE&rft.btitle=2009%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech%20and%20Signal%20Processing&rft.au=Nosratighods,%20M.&rft.date=2009-01-01&rft.spage=4233&rft.epage=4236&rft.pages=4233-4236&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9781424423538&rft.isbn_list=1424423538&rft_id=info:doi/10.1109/ICASSP.2009.4960563&rft.eisbn=9781424423545&rft.eisbn_list=1424423546&rft_dat=%3Cieee_6IE%3E4960563%3C/ieee_6IE%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i220t-4dec016546d4891f1466e8af46d00f53a4c3ca1ecca45b85da9be1e42d6ecf573%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4960563&rfr_iscdi=true |