Loading…

Noise suppression with high speech quality based on weighted noise estimation and MMSE STSA

This paper proposes a high speech quality noise suppression method based on weighted noise estimation and MMSE STSA. The proposed method continuously updates the noise estimate, using weighted noisy speech according to the estimated speech‐to‐noise ratio. In order to fully utilize the improvement of...

Full description

Saved in:
Bibliographic Details
Published in:Electronics & communications in Japan. Part 3, Fundamental electronic science Fundamental electronic science, 2006-02, Vol.89 (2), p.43-53
Main Authors: Kato, Masanori, Sugiyama, Akihiko, Serizawa, Masahiro
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c3115-7de5207332ac950ed77094c5560e521ad3fe6e7b03d1562abd47da180874ad183
cites cdi_FETCH-LOGICAL-c3115-7de5207332ac950ed77094c5560e521ad3fe6e7b03d1562abd47da180874ad183
container_end_page 53
container_issue 2
container_start_page 43
container_title Electronics & communications in Japan. Part 3, Fundamental electronic science
container_volume 89
creator Kato, Masanori
Sugiyama, Akihiko
Serizawa, Masahiro
description This paper proposes a high speech quality noise suppression method based on weighted noise estimation and MMSE STSA. The proposed method continuously updates the noise estimate, using weighted noisy speech according to the estimated speech‐to‐noise ratio. In order to fully utilize the improvement offered by noise estimation, the spectral gain is corrected according to the estimated speech‐to‐noise ratio. By using accurate noise estimation, more accurate SNR than in the conventional method is obtained, which helps to reduce distortion in the enhanced speech. In subjective speech quality evaluations, the five‐stage MOS was improved by 0.35 and 0.40 at the maximum, respectively, for the cases in which the speech was encoded and was not encoded after noise suppression. The improved version, which was developed on the basis of the proposed noise suppressor, satisfies all 3GPP minimum requirements for speech quality and has been installed in a commercially available model. © 2005 Wiley Periodicals, Inc. Electron Comm Jpn Pt 3, 89(2): 43–53, 2006; Published online in Wiley InterScience (www.interscience. wiley.com). DOI 10.1002/ecjc.20145
doi_str_mv 10.1002/ecjc.20145
format article
fullrecord <record><control><sourceid>wiley_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1002_ecjc_20145</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>ECJC20145</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3115-7de5207332ac950ed77094c5560e521ad3fe6e7b03d1562abd47da180874ad183</originalsourceid><addsrcrecordid>eNp9kFFPwjAUhRujiYi--Av6bDK8XdcVHsmCqAHEDKOJD01ZL66IMNcR3L-3A_XRp3uSc76bk0PIJYMOAwivMVtmnRBYJI5Ii4kQgjiK4NhriMIAerE8JWfOLQG8FrxFXicb65C6bVGU6JzdrOnOVjnN7VtOXYGY5fRzq1e2qulcOzS0SaB3K6_XexhdZT901bB6beh4nA5oOkv75-RkoVcOL35umzzdDGbJbTB6GN4l_VGQccZEIA36opLzUGc9AWikhF6UCRGDN5g2fIExyjlww0Qc6rmJpNGsC10ZacO6vE2uDn-zcuNciQtVlL5QWSsGqplFNbOo_Sw-zA7hnV1h_U9SDZL75JcJDox1FX79Mbp8V7HkUqjnyVC9TEdsKtJHxfk3PUVz4w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Noise suppression with high speech quality based on weighted noise estimation and MMSE STSA</title><source>Wiley-Blackwell Read &amp; Publish Collection</source><creator>Kato, Masanori ; Sugiyama, Akihiko ; Serizawa, Masahiro</creator><creatorcontrib>Kato, Masanori ; Sugiyama, Akihiko ; Serizawa, Masahiro</creatorcontrib><description>This paper proposes a high speech quality noise suppression method based on weighted noise estimation and MMSE STSA. The proposed method continuously updates the noise estimate, using weighted noisy speech according to the estimated speech‐to‐noise ratio. In order to fully utilize the improvement offered by noise estimation, the spectral gain is corrected according to the estimated speech‐to‐noise ratio. By using accurate noise estimation, more accurate SNR than in the conventional method is obtained, which helps to reduce distortion in the enhanced speech. In subjective speech quality evaluations, the five‐stage MOS was improved by 0.35 and 0.40 at the maximum, respectively, for the cases in which the speech was encoded and was not encoded after noise suppression. The improved version, which was developed on the basis of the proposed noise suppressor, satisfies all 3GPP minimum requirements for speech quality and has been installed in a commercially available model. © 2005 Wiley Periodicals, Inc. Electron Comm Jpn Pt 3, 89(2): 43–53, 2006; Published online in Wiley InterScience (www.interscience. wiley.com). DOI 10.1002/ecjc.20145</description><identifier>ISSN: 1042-0967</identifier><identifier>EISSN: 1520-6440</identifier><identifier>DOI: 10.1002/ecjc.20145</identifier><language>eng</language><publisher>Hoboken: Wiley Subscription Services, Inc., A Wiley Company</publisher><subject>noise estimation ; noise suppression ; short-time amplitude spectrum ; speech emphasis</subject><ispartof>Electronics &amp; communications in Japan. Part 3, Fundamental electronic science, 2006-02, Vol.89 (2), p.43-53</ispartof><rights>Copyright © 2005 Wiley Periodicals, Inc.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3115-7de5207332ac950ed77094c5560e521ad3fe6e7b03d1562abd47da180874ad183</citedby><cites>FETCH-LOGICAL-c3115-7de5207332ac950ed77094c5560e521ad3fe6e7b03d1562abd47da180874ad183</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>Kato, Masanori</creatorcontrib><creatorcontrib>Sugiyama, Akihiko</creatorcontrib><creatorcontrib>Serizawa, Masahiro</creatorcontrib><title>Noise suppression with high speech quality based on weighted noise estimation and MMSE STSA</title><title>Electronics &amp; communications in Japan. Part 3, Fundamental electronic science</title><addtitle>Electron. Comm. Jpn. Pt. III</addtitle><description>This paper proposes a high speech quality noise suppression method based on weighted noise estimation and MMSE STSA. The proposed method continuously updates the noise estimate, using weighted noisy speech according to the estimated speech‐to‐noise ratio. In order to fully utilize the improvement offered by noise estimation, the spectral gain is corrected according to the estimated speech‐to‐noise ratio. By using accurate noise estimation, more accurate SNR than in the conventional method is obtained, which helps to reduce distortion in the enhanced speech. In subjective speech quality evaluations, the five‐stage MOS was improved by 0.35 and 0.40 at the maximum, respectively, for the cases in which the speech was encoded and was not encoded after noise suppression. The improved version, which was developed on the basis of the proposed noise suppressor, satisfies all 3GPP minimum requirements for speech quality and has been installed in a commercially available model. © 2005 Wiley Periodicals, Inc. Electron Comm Jpn Pt 3, 89(2): 43–53, 2006; Published online in Wiley InterScience (www.interscience. wiley.com). DOI 10.1002/ecjc.20145</description><subject>noise estimation</subject><subject>noise suppression</subject><subject>short-time amplitude spectrum</subject><subject>speech emphasis</subject><issn>1042-0967</issn><issn>1520-6440</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2006</creationdate><recordtype>article</recordtype><recordid>eNp9kFFPwjAUhRujiYi--Av6bDK8XdcVHsmCqAHEDKOJD01ZL66IMNcR3L-3A_XRp3uSc76bk0PIJYMOAwivMVtmnRBYJI5Ii4kQgjiK4NhriMIAerE8JWfOLQG8FrxFXicb65C6bVGU6JzdrOnOVjnN7VtOXYGY5fRzq1e2qulcOzS0SaB3K6_XexhdZT901bB6beh4nA5oOkv75-RkoVcOL35umzzdDGbJbTB6GN4l_VGQccZEIA36opLzUGc9AWikhF6UCRGDN5g2fIExyjlww0Qc6rmJpNGsC10ZacO6vE2uDn-zcuNciQtVlL5QWSsGqplFNbOo_Sw-zA7hnV1h_U9SDZL75JcJDox1FX79Mbp8V7HkUqjnyVC9TEdsKtJHxfk3PUVz4w</recordid><startdate>200602</startdate><enddate>200602</enddate><creator>Kato, Masanori</creator><creator>Sugiyama, Akihiko</creator><creator>Serizawa, Masahiro</creator><general>Wiley Subscription Services, Inc., A Wiley Company</general><scope>BSCLL</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>200602</creationdate><title>Noise suppression with high speech quality based on weighted noise estimation and MMSE STSA</title><author>Kato, Masanori ; Sugiyama, Akihiko ; Serizawa, Masahiro</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3115-7de5207332ac950ed77094c5560e521ad3fe6e7b03d1562abd47da180874ad183</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2006</creationdate><topic>noise estimation</topic><topic>noise suppression</topic><topic>short-time amplitude spectrum</topic><topic>speech emphasis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kato, Masanori</creatorcontrib><creatorcontrib>Sugiyama, Akihiko</creatorcontrib><creatorcontrib>Serizawa, Masahiro</creatorcontrib><collection>Istex</collection><collection>CrossRef</collection><jtitle>Electronics &amp; communications in Japan. Part 3, Fundamental electronic science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kato, Masanori</au><au>Sugiyama, Akihiko</au><au>Serizawa, Masahiro</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Noise suppression with high speech quality based on weighted noise estimation and MMSE STSA</atitle><jtitle>Electronics &amp; communications in Japan. Part 3, Fundamental electronic science</jtitle><addtitle>Electron. Comm. Jpn. Pt. III</addtitle><date>2006-02</date><risdate>2006</risdate><volume>89</volume><issue>2</issue><spage>43</spage><epage>53</epage><pages>43-53</pages><issn>1042-0967</issn><eissn>1520-6440</eissn><abstract>This paper proposes a high speech quality noise suppression method based on weighted noise estimation and MMSE STSA. The proposed method continuously updates the noise estimate, using weighted noisy speech according to the estimated speech‐to‐noise ratio. In order to fully utilize the improvement offered by noise estimation, the spectral gain is corrected according to the estimated speech‐to‐noise ratio. By using accurate noise estimation, more accurate SNR than in the conventional method is obtained, which helps to reduce distortion in the enhanced speech. In subjective speech quality evaluations, the five‐stage MOS was improved by 0.35 and 0.40 at the maximum, respectively, for the cases in which the speech was encoded and was not encoded after noise suppression. The improved version, which was developed on the basis of the proposed noise suppressor, satisfies all 3GPP minimum requirements for speech quality and has been installed in a commercially available model. © 2005 Wiley Periodicals, Inc. Electron Comm Jpn Pt 3, 89(2): 43–53, 2006; Published online in Wiley InterScience (www.interscience. wiley.com). DOI 10.1002/ecjc.20145</abstract><cop>Hoboken</cop><pub>Wiley Subscription Services, Inc., A Wiley Company</pub><doi>10.1002/ecjc.20145</doi><tpages>11</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1042-0967
ispartof Electronics & communications in Japan. Part 3, Fundamental electronic science, 2006-02, Vol.89 (2), p.43-53
issn 1042-0967
1520-6440
language eng
recordid cdi_crossref_primary_10_1002_ecjc_20145
source Wiley-Blackwell Read & Publish Collection
subjects noise estimation
noise suppression
short-time amplitude spectrum
speech emphasis
title Noise suppression with high speech quality based on weighted noise estimation and MMSE STSA
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T06%3A07%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-wiley_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Noise%20suppression%20with%20high%20speech%20quality%20based%20on%20weighted%20noise%20estimation%20and%20MMSE%20STSA&rft.jtitle=Electronics%20&%20communications%20in%20Japan.%20Part%203,%20Fundamental%20electronic%20science&rft.au=Kato,%20Masanori&rft.date=2006-02&rft.volume=89&rft.issue=2&rft.spage=43&rft.epage=53&rft.pages=43-53&rft.issn=1042-0967&rft.eissn=1520-6440&rft_id=info:doi/10.1002/ecjc.20145&rft_dat=%3Cwiley_cross%3EECJC20145%3C/wiley_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c3115-7de5207332ac950ed77094c5560e521ad3fe6e7b03d1562abd47da180874ad183%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true