Loading…

Blind Signal-to-Noise Ratio Estimation of Speech Based on Vector Quantizer Classifiers and Decision Level Fusion

A blind approach for estimating the signal to noise ratio (SNR) of a speech signal corrupted by additive noise is proposed. The method is based on a pattern recognition paradigm using various linear predictive based features, a vector quantizer classifier and estimation combination. Blind SNR estima...

Full description

Saved in:

Bibliographic Details
Published in:	Journal of signal processing systems 2017-11, Vol.89 (2), p.335-345
Main Authors:	Ondusko, Russell, Marbach, Matthew, Ramachandran, Ravi P., Head, Linda M.
Format:	Article
Language:	English
Subjects:	Bandpass Circuits and Systems Classifiers Computer Imaging Electrical Engineering Engineering Errors Identification systems Image Processing and Computer Vision Military vehicles Pattern Recognition Pattern Recognition and Graphics Signal to noise ratio Signal,Image and Speech Processing Speech Speech processing Vision
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by	cdi_FETCH-LOGICAL-c316t-5add76ff242a27728f3817833f0b2c4098699965efab4b0f4cb3c171bfdae5003
cites	cdi_FETCH-LOGICAL-c316t-5add76ff242a27728f3817833f0b2c4098699965efab4b0f4cb3c171bfdae5003
container_end_page	345
container_issue	2
container_start_page	335
container_title	Journal of signal processing systems
container_volume	89
creator	Ondusko, Russell Marbach, Matthew Ramachandran, Ravi P. Head, Linda M.
description	A blind approach for estimating the signal to noise ratio (SNR) of a speech signal corrupted by additive noise is proposed. The method is based on a pattern recognition paradigm using various linear predictive based features, a vector quantizer classifier and estimation combination. Blind SNR estimation is very useful in speaker identification systems in which a confidence metric is determined along with the speaker identity. The confidence metric is partially based on the mismatch between the training and testing conditions of the speaker identification system and SNR estimation is very important in evaluating the degree of this mismatch. The aim is to correctly estimate SNR values from 0 to 30 dB, a range that is both practical and crucial for speaker identification systems. Experiments consider (1) artificially generated additive white Gaussian noise, pink noise and bandpass noise and (2) fifteen noise types from the NOISEX database. Four features are combined to get the best results. The average SNR estimation error depends on the type of noise in that a relatively low error results for pink noise and jet cockpit noise and a high error results for destroyer operations room noise and military vehicle noise. For both artificially generated noise and the NOISEX data, the error is lower than what is achieved by the IMCRA method that uses SNR estimation for speech enhancement. Combining the four features with IMCRA lowers the error for 8 of the 15 noise types from NOISEX.
doi_str_mv	10.1007/s11265-016-1200-z
format	article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1930131164</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1930131164</sourcerecordid><originalsourceid>FETCH-LOGICAL-c316t-5add76ff242a27728f3817833f0b2c4098699965efab4b0f4cb3c171bfdae5003</originalsourceid><addsrcrecordid>eNp1kEtLAzEUhYMoWKs_wF3AdfTezHtpq1WhKFp1GzKZpKaMMzWZEeyvN8MouHF1H5xz4HyEnCKcI0B24RF5mjDAlCEHYLs9MsEiKliOmOz_7oD5ITnyfgOQQpbghGxntW0qurLrRtasa9l9a72mT7KzLb32nX0ftoa2hq62Wqs3OpNeVzS8XrXqWkcfe9l0dqcdndfSe2usdp7KEHqllfWDeak_dU0X_XAckwMja69PfuaUvCyun-e3bPlwcze_XDIVYdqxRFZVlhrDYy55lvHcRDlmeRQZKLmKocjToijSRBtZxiWYWJWRwgxLU0mdAERTcjbmbl370WvfiU3bu1DSi8ACMEJM46DCUaVc673TRmxdqOy-BIIYwIoRrAhgxQBW7IKHjx4ftM1auz_J_5q-AYqce-A</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1930131164</pqid></control><display><type>article</type><title>Blind Signal-to-Noise Ratio Estimation of Speech Based on Vector Quantizer Classifiers and Decision Level Fusion</title><source>Springer Link</source><creator>Ondusko, Russell ; Marbach, Matthew ; Ramachandran, Ravi P. ; Head, Linda M.</creator><creatorcontrib>Ondusko, Russell ; Marbach, Matthew ; Ramachandran, Ravi P. ; Head, Linda M.</creatorcontrib><description>A blind approach for estimating the signal to noise ratio (SNR) of a speech signal corrupted by additive noise is proposed. The method is based on a pattern recognition paradigm using various linear predictive based features, a vector quantizer classifier and estimation combination. Blind SNR estimation is very useful in speaker identification systems in which a confidence metric is determined along with the speaker identity. The confidence metric is partially based on the mismatch between the training and testing conditions of the speaker identification system and SNR estimation is very important in evaluating the degree of this mismatch. The aim is to correctly estimate SNR values from 0 to 30 dB, a range that is both practical and crucial for speaker identification systems. Experiments consider (1) artificially generated additive white Gaussian noise, pink noise and bandpass noise and (2) fifteen noise types from the NOISEX database. Four features are combined to get the best results. The average SNR estimation error depends on the type of noise in that a relatively low error results for pink noise and jet cockpit noise and a high error results for destroyer operations room noise and military vehicle noise. For both artificially generated noise and the NOISEX data, the error is lower than what is achieved by the IMCRA method that uses SNR estimation for speech enhancement. Combining the four features with IMCRA lowers the error for 8 of the 15 noise types from NOISEX.</description><identifier>ISSN: 1939-8018</identifier><identifier>EISSN: 1939-8115</identifier><identifier>DOI: 10.1007/s11265-016-1200-z</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Bandpass ; Circuits and Systems ; Classifiers ; Computer Imaging ; Electrical Engineering ; Engineering ; Errors ; Identification systems ; Image Processing and Computer Vision ; Military vehicles ; Pattern Recognition ; Pattern Recognition and Graphics ; Signal to noise ratio ; Signal,Image and Speech Processing ; Speech ; Speech processing ; Vision</subject><ispartof>Journal of signal processing systems, 2017-11, Vol.89 (2), p.335-345</ispartof><rights>Springer Science+Business Media New York 2016</rights><rights>Copyright Springer Science & Business Media 2017</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c316t-5add76ff242a27728f3817833f0b2c4098699965efab4b0f4cb3c171bfdae5003</citedby><cites>FETCH-LOGICAL-c316t-5add76ff242a27728f3817833f0b2c4098699965efab4b0f4cb3c171bfdae5003</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Ondusko, Russell</creatorcontrib><creatorcontrib>Marbach, Matthew</creatorcontrib><creatorcontrib>Ramachandran, Ravi P.</creatorcontrib><creatorcontrib>Head, Linda M.</creatorcontrib><title>Blind Signal-to-Noise Ratio Estimation of Speech Based on Vector Quantizer Classifiers and Decision Level Fusion</title><title>Journal of signal processing systems</title><addtitle>J Sign Process Syst</addtitle><description>A blind approach for estimating the signal to noise ratio (SNR) of a speech signal corrupted by additive noise is proposed. The method is based on a pattern recognition paradigm using various linear predictive based features, a vector quantizer classifier and estimation combination. Blind SNR estimation is very useful in speaker identification systems in which a confidence metric is determined along with the speaker identity. The confidence metric is partially based on the mismatch between the training and testing conditions of the speaker identification system and SNR estimation is very important in evaluating the degree of this mismatch. The aim is to correctly estimate SNR values from 0 to 30 dB, a range that is both practical and crucial for speaker identification systems. Experiments consider (1) artificially generated additive white Gaussian noise, pink noise and bandpass noise and (2) fifteen noise types from the NOISEX database. Four features are combined to get the best results. The average SNR estimation error depends on the type of noise in that a relatively low error results for pink noise and jet cockpit noise and a high error results for destroyer operations room noise and military vehicle noise. For both artificially generated noise and the NOISEX data, the error is lower than what is achieved by the IMCRA method that uses SNR estimation for speech enhancement. Combining the four features with IMCRA lowers the error for 8 of the 15 noise types from NOISEX.</description><subject>Bandpass</subject><subject>Circuits and Systems</subject><subject>Classifiers</subject><subject>Computer Imaging</subject><subject>Electrical Engineering</subject><subject>Engineering</subject><subject>Errors</subject><subject>Identification systems</subject><subject>Image Processing and Computer Vision</subject><subject>Military vehicles</subject><subject>Pattern Recognition</subject><subject>Pattern Recognition and Graphics</subject><subject>Signal to noise ratio</subject><subject>Signal,Image and Speech Processing</subject><subject>Speech</subject><subject>Speech processing</subject><subject>Vision</subject><issn>1939-8018</issn><issn>1939-8115</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><recordid>eNp1kEtLAzEUhYMoWKs_wF3AdfTezHtpq1WhKFp1GzKZpKaMMzWZEeyvN8MouHF1H5xz4HyEnCKcI0B24RF5mjDAlCEHYLs9MsEiKliOmOz_7oD5ITnyfgOQQpbghGxntW0qurLrRtasa9l9a72mT7KzLb32nX0ftoa2hq62Wqs3OpNeVzS8XrXqWkcfe9l0dqcdndfSe2usdp7KEHqllfWDeak_dU0X_XAckwMja69PfuaUvCyun-e3bPlwcze_XDIVYdqxRFZVlhrDYy55lvHcRDlmeRQZKLmKocjToijSRBtZxiWYWJWRwgxLU0mdAERTcjbmbl370WvfiU3bu1DSi8ACMEJM46DCUaVc673TRmxdqOy-BIIYwIoRrAhgxQBW7IKHjx4ftM1auz_J_5q-AYqce-A</recordid><startdate>20171101</startdate><enddate>20171101</enddate><creator>Ondusko, Russell</creator><creator>Marbach, Matthew</creator><creator>Ramachandran, Ravi P.</creator><creator>Head, Linda M.</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20171101</creationdate><title>Blind Signal-to-Noise Ratio Estimation of Speech Based on Vector Quantizer Classifiers and Decision Level Fusion</title><author>Ondusko, Russell ; Marbach, Matthew ; Ramachandran, Ravi P. ; Head, Linda M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c316t-5add76ff242a27728f3817833f0b2c4098699965efab4b0f4cb3c171bfdae5003</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Bandpass</topic><topic>Circuits and Systems</topic><topic>Classifiers</topic><topic>Computer Imaging</topic><topic>Electrical Engineering</topic><topic>Engineering</topic><topic>Errors</topic><topic>Identification systems</topic><topic>Image Processing and Computer Vision</topic><topic>Military vehicles</topic><topic>Pattern Recognition</topic><topic>Pattern Recognition and Graphics</topic><topic>Signal to noise ratio</topic><topic>Signal,Image and Speech Processing</topic><topic>Speech</topic><topic>Speech processing</topic><topic>Vision</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ondusko, Russell</creatorcontrib><creatorcontrib>Marbach, Matthew</creatorcontrib><creatorcontrib>Ramachandran, Ravi P.</creatorcontrib><creatorcontrib>Head, Linda M.</creatorcontrib><collection>CrossRef</collection><jtitle>Journal of signal processing systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ondusko, Russell</au><au>Marbach, Matthew</au><au>Ramachandran, Ravi P.</au><au>Head, Linda M.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Blind Signal-to-Noise Ratio Estimation of Speech Based on Vector Quantizer Classifiers and Decision Level Fusion</atitle><jtitle>Journal of signal processing systems</jtitle><stitle>J Sign Process Syst</stitle><date>2017-11-01</date><risdate>2017</risdate><volume>89</volume><issue>2</issue><spage>335</spage><epage>345</epage><pages>335-345</pages><issn>1939-8018</issn><eissn>1939-8115</eissn><abstract>A blind approach for estimating the signal to noise ratio (SNR) of a speech signal corrupted by additive noise is proposed. The method is based on a pattern recognition paradigm using various linear predictive based features, a vector quantizer classifier and estimation combination. Blind SNR estimation is very useful in speaker identification systems in which a confidence metric is determined along with the speaker identity. The confidence metric is partially based on the mismatch between the training and testing conditions of the speaker identification system and SNR estimation is very important in evaluating the degree of this mismatch. The aim is to correctly estimate SNR values from 0 to 30 dB, a range that is both practical and crucial for speaker identification systems. Experiments consider (1) artificially generated additive white Gaussian noise, pink noise and bandpass noise and (2) fifteen noise types from the NOISEX database. Four features are combined to get the best results. The average SNR estimation error depends on the type of noise in that a relatively low error results for pink noise and jet cockpit noise and a high error results for destroyer operations room noise and military vehicle noise. For both artificially generated noise and the NOISEX data, the error is lower than what is achieved by the IMCRA method that uses SNR estimation for speech enhancement. Combining the four features with IMCRA lowers the error for 8 of the 15 noise types from NOISEX.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11265-016-1200-z</doi><tpages>11</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 1939-8018
ispartof	Journal of signal processing systems, 2017-11, Vol.89 (2), p.335-345
issn	1939-8018 1939-8115
language	eng
recordid	cdi_proquest_journals_1930131164
source	Springer Link
subjects	Bandpass Circuits and Systems Classifiers Computer Imaging Electrical Engineering Engineering Errors Identification systems Image Processing and Computer Vision Military vehicles Pattern Recognition Pattern Recognition and Graphics Signal to noise ratio Signal,Image and Speech Processing Speech Speech processing Vision
title	Blind Signal-to-Noise Ratio Estimation of Speech Based on Vector Quantizer Classifiers and Decision Level Fusion
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T06%3A16%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Blind%20Signal-to-Noise%20Ratio%20Estimation%20of%20Speech%20Based%20on%20Vector%20Quantizer%20Classifiers%20and%20Decision%20Level%20Fusion&rft.jtitle=Journal%20of%20signal%20processing%20systems&rft.au=Ondusko,%20Russell&rft.date=2017-11-01&rft.volume=89&rft.issue=2&rft.spage=335&rft.epage=345&rft.pages=335-345&rft.issn=1939-8018&rft.eissn=1939-8115&rft_id=info:doi/10.1007/s11265-016-1200-z&rft_dat=%3Cproquest_cross%3E1930131164%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c316t-5add76ff242a27728f3817833f0b2c4098699965efab4b0f4cb3c171bfdae5003%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1930131164&rft_id=info:pmid/&rfr_iscdi=true