Loading…
A speech intelligibility index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners
The SII model in its present form (ANSI S3.5-1997, American National Standards Institute, New York) can accurately describe intelligibility for speech in stationary noise but fails to do so for nonstationary noise maskers. Here, an extension to the SII model is proposed with the aim to predict the s...
Saved in:
Published in: | The Journal of the Acoustical Society of America 2005-04, Vol.117 (4), p.2181-2192 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c379t-c60f216128b05921da0931563edc90f1a418d3ac34619f1df51adb64dd644b163 |
---|---|
cites | cdi_FETCH-LOGICAL-c379t-c60f216128b05921da0931563edc90f1a418d3ac34619f1df51adb64dd644b163 |
container_end_page | 2192 |
container_issue | 4 |
container_start_page | 2181 |
container_title | The Journal of the Acoustical Society of America |
container_volume | 117 |
creator | RHEBERGEN, Koenraad S VERSFELD, Niek J |
description | The SII model in its present form (ANSI S3.5-1997, American National Standards Institute, New York) can accurately describe intelligibility for speech in stationary noise but fails to do so for nonstationary noise maskers. Here, an extension to the SII model is proposed with the aim to predict the speech intelligibility in both stationary and fluctuating noise. The basic principle of the present approach is that both speech and noise signal are partitioned into small time frames. Within each time frame the conventional SII is determined, yielding the speech information available to the listener at that time frame. Next, the SII values of these time frames are averaged, resulting in the SII for that particular condition. Using speech reception threshold (SRT) data from the literature, the extension to the present SII model can give a good account for SRTs in stationary noise, fluctuating speech noise, interrupted noise, and multiple-talker noise. The predictions for sinusoidally intensity modulated (SIM) noise and real speech or speech-like maskers are better than with the original SII model, but are still not accurate. For the latter type of maskers, informational masking may play a role. |
doi_str_mv | 10.1121/1.1861713 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_85641736</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>85641736</sourcerecordid><originalsourceid>FETCH-LOGICAL-c379t-c60f216128b05921da0931563edc90f1a418d3ac34619f1df51adb64dd644b163</originalsourceid><addsrcrecordid>eNpFkc9uFDEMxqOKim4LB14A5QIShynx5M9OjlUFFKkSl3IeZRKnG5SdDElGok_C65LtLqp8sGz__En-TMg7YNcAPXyGaxgUbIGfkQ3InnWD7MUrsmGMQSe0UhfkspRfrZQD16_JBchBD0rqDfl7Q8uCaHc0zBVjDI9hCjHUp1Y7_NNNpqCjZllyMg2qiS4ZXbCV1h3-X81ocakhza2ZsexSdNSnTAs2zdliaWLUx9XW1dQwP9I5hYLPyJzy3sRuhyYfBjGUtoG5vCHn3sSCb0_5ivz8-uXh9q67__Ht--3NfWf5VtfOKuZ7UNAPE5O6B2eY5iAVR2c182AEDI4by4UC7cF5CcZNSjinhJhA8Svy8ajbDvy9YqnjPhTbjDAzprWMg1QCtvwAfjqCNqdSMvpxyWFv8tMIbDx8YWxx_EJj359E12mP7oU82d6ADyfAFGuiz2a2obxwatuLQQL_B21rkb0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>85641736</pqid></control><display><type>article</type><title>A speech intelligibility index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners</title><source>American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)</source><source>Linguistics and Language Behavior Abstracts (LLBA)</source><creator>RHEBERGEN, Koenraad S ; VERSFELD, Niek J</creator><creatorcontrib>RHEBERGEN, Koenraad S ; VERSFELD, Niek J</creatorcontrib><description>The SII model in its present form (ANSI S3.5-1997, American National Standards Institute, New York) can accurately describe intelligibility for speech in stationary noise but fails to do so for nonstationary noise maskers. Here, an extension to the SII model is proposed with the aim to predict the speech intelligibility in both stationary and fluctuating noise. The basic principle of the present approach is that both speech and noise signal are partitioned into small time frames. Within each time frame the conventional SII is determined, yielding the speech information available to the listener at that time frame. Next, the SII values of these time frames are averaged, resulting in the SII for that particular condition. Using speech reception threshold (SRT) data from the literature, the extension to the present SII model can give a good account for SRTs in stationary noise, fluctuating speech noise, interrupted noise, and multiple-talker noise. The predictions for sinusoidally intensity modulated (SIM) noise and real speech or speech-like maskers are better than with the original SII model, but are still not accurate. For the latter type of maskers, informational masking may play a role.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.1861713</identifier><identifier>PMID: 15898659</identifier><identifier>CODEN: JASMAN</identifier><language>eng</language><publisher>Woodbury, NY: Acoustical Society of America</publisher><subject>Acoustics ; Attention ; Audition ; Auditory Threshold ; Biological and medical sciences ; Exact sciences and technology ; Fundamental and applied biological sciences. Psychology ; Fundamental areas of phenomenology (including applications) ; Humans ; Mathematical Computing ; Noise ; Perception ; Perceptual Masking ; Physics ; Psychoacoustics ; Psychology. Psychoanalysis. Psychiatry ; Psychology. Psychophysiology ; Reference Values ; Signal Processing, Computer-Assisted ; Sound Localization ; Sound Spectrography ; Speech Perception ; Speech Reception Threshold Test</subject><ispartof>The Journal of the Acoustical Society of America, 2005-04, Vol.117 (4), p.2181-2192</ispartof><rights>2005 INIST-CNRS</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c379t-c60f216128b05921da0931563edc90f1a418d3ac34619f1df51adb64dd644b163</citedby><cites>FETCH-LOGICAL-c379t-c60f216128b05921da0931563edc90f1a418d3ac34619f1df51adb64dd644b163</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906,31251</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=16724851$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/15898659$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>RHEBERGEN, Koenraad S</creatorcontrib><creatorcontrib>VERSFELD, Niek J</creatorcontrib><title>A speech intelligibility index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners</title><title>The Journal of the Acoustical Society of America</title><addtitle>J Acoust Soc Am</addtitle><description>The SII model in its present form (ANSI S3.5-1997, American National Standards Institute, New York) can accurately describe intelligibility for speech in stationary noise but fails to do so for nonstationary noise maskers. Here, an extension to the SII model is proposed with the aim to predict the speech intelligibility in both stationary and fluctuating noise. The basic principle of the present approach is that both speech and noise signal are partitioned into small time frames. Within each time frame the conventional SII is determined, yielding the speech information available to the listener at that time frame. Next, the SII values of these time frames are averaged, resulting in the SII for that particular condition. Using speech reception threshold (SRT) data from the literature, the extension to the present SII model can give a good account for SRTs in stationary noise, fluctuating speech noise, interrupted noise, and multiple-talker noise. The predictions for sinusoidally intensity modulated (SIM) noise and real speech or speech-like maskers are better than with the original SII model, but are still not accurate. For the latter type of maskers, informational masking may play a role.</description><subject>Acoustics</subject><subject>Attention</subject><subject>Audition</subject><subject>Auditory Threshold</subject><subject>Biological and medical sciences</subject><subject>Exact sciences and technology</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>Fundamental areas of phenomenology (including applications)</subject><subject>Humans</subject><subject>Mathematical Computing</subject><subject>Noise</subject><subject>Perception</subject><subject>Perceptual Masking</subject><subject>Physics</subject><subject>Psychoacoustics</subject><subject>Psychology. Psychoanalysis. Psychiatry</subject><subject>Psychology. Psychophysiology</subject><subject>Reference Values</subject><subject>Signal Processing, Computer-Assisted</subject><subject>Sound Localization</subject><subject>Sound Spectrography</subject><subject>Speech Perception</subject><subject>Speech Reception Threshold Test</subject><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><sourceid>7T9</sourceid><recordid>eNpFkc9uFDEMxqOKim4LB14A5QIShynx5M9OjlUFFKkSl3IeZRKnG5SdDElGok_C65LtLqp8sGz__En-TMg7YNcAPXyGaxgUbIGfkQ3InnWD7MUrsmGMQSe0UhfkspRfrZQD16_JBchBD0rqDfl7Q8uCaHc0zBVjDI9hCjHUp1Y7_NNNpqCjZllyMg2qiS4ZXbCV1h3-X81ocakhza2ZsexSdNSnTAs2zdliaWLUx9XW1dQwP9I5hYLPyJzy3sRuhyYfBjGUtoG5vCHn3sSCb0_5ivz8-uXh9q67__Ht--3NfWf5VtfOKuZ7UNAPE5O6B2eY5iAVR2c182AEDI4by4UC7cF5CcZNSjinhJhA8Svy8ajbDvy9YqnjPhTbjDAzprWMg1QCtvwAfjqCNqdSMvpxyWFv8tMIbDx8YWxx_EJj359E12mP7oU82d6ADyfAFGuiz2a2obxwatuLQQL_B21rkb0</recordid><startdate>20050401</startdate><enddate>20050401</enddate><creator>RHEBERGEN, Koenraad S</creator><creator>VERSFELD, Niek J</creator><general>Acoustical Society of America</general><general>American Institute of Physics</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7T9</scope></search><sort><creationdate>20050401</creationdate><title>A speech intelligibility index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners</title><author>RHEBERGEN, Koenraad S ; VERSFELD, Niek J</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c379t-c60f216128b05921da0931563edc90f1a418d3ac34619f1df51adb64dd644b163</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Acoustics</topic><topic>Attention</topic><topic>Audition</topic><topic>Auditory Threshold</topic><topic>Biological and medical sciences</topic><topic>Exact sciences and technology</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>Fundamental areas of phenomenology (including applications)</topic><topic>Humans</topic><topic>Mathematical Computing</topic><topic>Noise</topic><topic>Perception</topic><topic>Perceptual Masking</topic><topic>Physics</topic><topic>Psychoacoustics</topic><topic>Psychology. Psychoanalysis. Psychiatry</topic><topic>Psychology. Psychophysiology</topic><topic>Reference Values</topic><topic>Signal Processing, Computer-Assisted</topic><topic>Sound Localization</topic><topic>Sound Spectrography</topic><topic>Speech Perception</topic><topic>Speech Reception Threshold Test</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>RHEBERGEN, Koenraad S</creatorcontrib><creatorcontrib>VERSFELD, Niek J</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>RHEBERGEN, Koenraad S</au><au>VERSFELD, Niek J</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A speech intelligibility index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><addtitle>J Acoust Soc Am</addtitle><date>2005-04-01</date><risdate>2005</risdate><volume>117</volume><issue>4</issue><spage>2181</spage><epage>2192</epage><pages>2181-2192</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><coden>JASMAN</coden><abstract>The SII model in its present form (ANSI S3.5-1997, American National Standards Institute, New York) can accurately describe intelligibility for speech in stationary noise but fails to do so for nonstationary noise maskers. Here, an extension to the SII model is proposed with the aim to predict the speech intelligibility in both stationary and fluctuating noise. The basic principle of the present approach is that both speech and noise signal are partitioned into small time frames. Within each time frame the conventional SII is determined, yielding the speech information available to the listener at that time frame. Next, the SII values of these time frames are averaged, resulting in the SII for that particular condition. Using speech reception threshold (SRT) data from the literature, the extension to the present SII model can give a good account for SRTs in stationary noise, fluctuating speech noise, interrupted noise, and multiple-talker noise. The predictions for sinusoidally intensity modulated (SIM) noise and real speech or speech-like maskers are better than with the original SII model, but are still not accurate. For the latter type of maskers, informational masking may play a role.</abstract><cop>Woodbury, NY</cop><pub>Acoustical Society of America</pub><pmid>15898659</pmid><doi>10.1121/1.1861713</doi><tpages>12</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0001-4966 |
ispartof | The Journal of the Acoustical Society of America, 2005-04, Vol.117 (4), p.2181-2192 |
issn | 0001-4966 1520-8524 |
language | eng |
recordid | cdi_proquest_miscellaneous_85641736 |
source | American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list); Linguistics and Language Behavior Abstracts (LLBA) |
subjects | Acoustics Attention Audition Auditory Threshold Biological and medical sciences Exact sciences and technology Fundamental and applied biological sciences. Psychology Fundamental areas of phenomenology (including applications) Humans Mathematical Computing Noise Perception Perceptual Masking Physics Psychoacoustics Psychology. Psychoanalysis. Psychiatry Psychology. Psychophysiology Reference Values Signal Processing, Computer-Assisted Sound Localization Sound Spectrography Speech Perception Speech Reception Threshold Test |
title | A speech intelligibility index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T11%3A42%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20speech%20intelligibility%20index-based%20approach%20to%20predict%20the%20speech%20reception%20threshold%20for%20sentences%20in%20fluctuating%20noise%20for%20normal-hearing%20listeners&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=RHEBERGEN,%20Koenraad%20S&rft.date=2005-04-01&rft.volume=117&rft.issue=4&rft.spage=2181&rft.epage=2192&rft.pages=2181-2192&rft.issn=0001-4966&rft.eissn=1520-8524&rft.coden=JASMAN&rft_id=info:doi/10.1121/1.1861713&rft_dat=%3Cproquest_cross%3E85641736%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c379t-c60f216128b05921da0931563edc90f1a418d3ac34619f1df51adb64dd644b163%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=85641736&rft_id=info:pmid/15898659&rfr_iscdi=true |