Loading…

Application of multidimensional scaling to subjective evaluation of coded speech

We present results from a pilot study directed at developing an anchorable subjective speech quality test. The test uses multidimensional scaling techniques to obtain quantitative information about the perceptual attributes of speech. In the first phase of the study, subjects ranked perceptual dista...

Full description

Saved in:
Bibliographic Details
Published in:The Journal of the Acoustical Society of America 2001-10, Vol.110 (4), p.2167-2182
Main Author: Hall, J L
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343
cites cdi_FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343
container_end_page 2182
container_issue 4
container_start_page 2167
container_title The Journal of the Acoustical Society of America
container_volume 110
creator Hall, J L
description We present results from a pilot study directed at developing an anchorable subjective speech quality test. The test uses multidimensional scaling techniques to obtain quantitative information about the perceptual attributes of speech. In the first phase of the study, subjects ranked perceptual distances between samples of speech produced by two different talkers, one male and one female, processed by a variety of codecs. The resulting distance matrices were processed to obtain, for each talker, a stimulus space for the various speech samples. This stimulus space has the properties that distances between stimuli in this space correspond to perceptual distances between stimuli and that the dimensions of this space correspond to attributes used by the subjects in determining perceptual distances. Mean opinion scores (MOS) scores obtained in an earlier study were found to be highly correlated with position in the stimulus space, and the three dimensions of the stimulus space were found to have identifiable physical and perceptual correlates. In the second phase of the study, we developed techniques for fitting speech generated by a new codec under investigation into a previously established stimulus space. The user is provided with a collection of speech samples and with the stimulus space for these speech samples as determined by a large-scale listening test. The user then carries out a much smaller listening test to determine the position of the new stimulus in the previously established stimulus space. This system is anchorable, so that different versions of a codec under development can be compared directly, and it provides more detailed information than the single number provided by MOS testing. We suggest that this information could be used to advantage in algorithm development and in development of objective measures of speech quality.
doi_str_mv 10.1121/1.1397322
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_85576258</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>72231484</sourcerecordid><originalsourceid>FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343</originalsourceid><addsrcrecordid>eNqFkEtLAzEUhYMotlYX_gGZleBiam4ek8yylPqAgi50PaSZG03JPJzMFPz3TmnRpdzF5V6-c-AcQq6BzgEY3MMceK44YydkCpLRVEsmTsmUUgqpyLNsQi5i3I6n1Dw_JxOATI8SPiWvi7YN3preN3XSuKQaQu9LX2Edx48JSbQm-Poj6ZskDpst2t7vMMGdCcOvyDYllklsEe3nJTlzJkS8Ou4ZeX9YvS2f0vXL4_NysU4tF6pPwTlrlDFMM6FBCWZkpnMAgY4y1EBLBKlQCpsrIcGY0jhXKo1SCyG44DNye_Btu-ZrwNgXlY8WQzA1NkMstJQqY2Pe_0DFGAeh9453B9B2TYwduqLtfGW67wJose-5GOfQ88jeHE2HTYXlH3kslv8AO4d3WA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>72231484</pqid></control><display><type>article</type><title>Application of multidimensional scaling to subjective evaluation of coded speech</title><source>American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)</source><source>Linguistics and Language Behavior Abstracts (LLBA)</source><creator>Hall, J L</creator><creatorcontrib>Hall, J L</creatorcontrib><description>We present results from a pilot study directed at developing an anchorable subjective speech quality test. The test uses multidimensional scaling techniques to obtain quantitative information about the perceptual attributes of speech. In the first phase of the study, subjects ranked perceptual distances between samples of speech produced by two different talkers, one male and one female, processed by a variety of codecs. The resulting distance matrices were processed to obtain, for each talker, a stimulus space for the various speech samples. This stimulus space has the properties that distances between stimuli in this space correspond to perceptual distances between stimuli and that the dimensions of this space correspond to attributes used by the subjects in determining perceptual distances. Mean opinion scores (MOS) scores obtained in an earlier study were found to be highly correlated with position in the stimulus space, and the three dimensions of the stimulus space were found to have identifiable physical and perceptual correlates. In the second phase of the study, we developed techniques for fitting speech generated by a new codec under investigation into a previously established stimulus space. The user is provided with a collection of speech samples and with the stimulus space for these speech samples as determined by a large-scale listening test. The user then carries out a much smaller listening test to determine the position of the new stimulus in the previously established stimulus space. This system is anchorable, so that different versions of a codec under development can be compared directly, and it provides more detailed information than the single number provided by MOS testing. We suggest that this information could be used to advantage in algorithm development and in development of objective measures of speech quality.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.1397322</identifier><identifier>PMID: 11681393</identifier><identifier>CODEN: JASMAN</identifier><language>eng</language><publisher>United States</publisher><subject>Adult ; Aged ; Female ; Humans ; Individuality ; Judgment ; Male ; Middle Aged ; Sound Spectrography ; Speech Acoustics ; Speech Perception ; Voice Quality</subject><ispartof>The Journal of the Acoustical Society of America, 2001-10, Vol.110 (4), p.2167-2182</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343</citedby><cites>FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925,31270</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/11681393$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hall, J L</creatorcontrib><title>Application of multidimensional scaling to subjective evaluation of coded speech</title><title>The Journal of the Acoustical Society of America</title><addtitle>J Acoust Soc Am</addtitle><description>We present results from a pilot study directed at developing an anchorable subjective speech quality test. The test uses multidimensional scaling techniques to obtain quantitative information about the perceptual attributes of speech. In the first phase of the study, subjects ranked perceptual distances between samples of speech produced by two different talkers, one male and one female, processed by a variety of codecs. The resulting distance matrices were processed to obtain, for each talker, a stimulus space for the various speech samples. This stimulus space has the properties that distances between stimuli in this space correspond to perceptual distances between stimuli and that the dimensions of this space correspond to attributes used by the subjects in determining perceptual distances. Mean opinion scores (MOS) scores obtained in an earlier study were found to be highly correlated with position in the stimulus space, and the three dimensions of the stimulus space were found to have identifiable physical and perceptual correlates. In the second phase of the study, we developed techniques for fitting speech generated by a new codec under investigation into a previously established stimulus space. The user is provided with a collection of speech samples and with the stimulus space for these speech samples as determined by a large-scale listening test. The user then carries out a much smaller listening test to determine the position of the new stimulus in the previously established stimulus space. This system is anchorable, so that different versions of a codec under development can be compared directly, and it provides more detailed information than the single number provided by MOS testing. We suggest that this information could be used to advantage in algorithm development and in development of objective measures of speech quality.</description><subject>Adult</subject><subject>Aged</subject><subject>Female</subject><subject>Humans</subject><subject>Individuality</subject><subject>Judgment</subject><subject>Male</subject><subject>Middle Aged</subject><subject>Sound Spectrography</subject><subject>Speech Acoustics</subject><subject>Speech Perception</subject><subject>Voice Quality</subject><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2001</creationdate><recordtype>article</recordtype><sourceid>7T9</sourceid><recordid>eNqFkEtLAzEUhYMotlYX_gGZleBiam4ek8yylPqAgi50PaSZG03JPJzMFPz3TmnRpdzF5V6-c-AcQq6BzgEY3MMceK44YydkCpLRVEsmTsmUUgqpyLNsQi5i3I6n1Dw_JxOATI8SPiWvi7YN3preN3XSuKQaQu9LX2Edx48JSbQm-Poj6ZskDpst2t7vMMGdCcOvyDYllklsEe3nJTlzJkS8Ou4ZeX9YvS2f0vXL4_NysU4tF6pPwTlrlDFMM6FBCWZkpnMAgY4y1EBLBKlQCpsrIcGY0jhXKo1SCyG44DNye_Btu-ZrwNgXlY8WQzA1NkMstJQqY2Pe_0DFGAeh9453B9B2TYwduqLtfGW67wJose-5GOfQ88jeHE2HTYXlH3kslv8AO4d3WA</recordid><startdate>20011001</startdate><enddate>20011001</enddate><creator>Hall, J L</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>8BM</scope><scope>7T9</scope></search><sort><creationdate>20011001</creationdate><title>Application of multidimensional scaling to subjective evaluation of coded speech</title><author>Hall, J L</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2001</creationdate><topic>Adult</topic><topic>Aged</topic><topic>Female</topic><topic>Humans</topic><topic>Individuality</topic><topic>Judgment</topic><topic>Male</topic><topic>Middle Aged</topic><topic>Sound Spectrography</topic><topic>Speech Acoustics</topic><topic>Speech Perception</topic><topic>Voice Quality</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hall, J L</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>ComDisDome</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hall, J L</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Application of multidimensional scaling to subjective evaluation of coded speech</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><addtitle>J Acoust Soc Am</addtitle><date>2001-10-01</date><risdate>2001</risdate><volume>110</volume><issue>4</issue><spage>2167</spage><epage>2182</epage><pages>2167-2182</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><coden>JASMAN</coden><abstract>We present results from a pilot study directed at developing an anchorable subjective speech quality test. The test uses multidimensional scaling techniques to obtain quantitative information about the perceptual attributes of speech. In the first phase of the study, subjects ranked perceptual distances between samples of speech produced by two different talkers, one male and one female, processed by a variety of codecs. The resulting distance matrices were processed to obtain, for each talker, a stimulus space for the various speech samples. This stimulus space has the properties that distances between stimuli in this space correspond to perceptual distances between stimuli and that the dimensions of this space correspond to attributes used by the subjects in determining perceptual distances. Mean opinion scores (MOS) scores obtained in an earlier study were found to be highly correlated with position in the stimulus space, and the three dimensions of the stimulus space were found to have identifiable physical and perceptual correlates. In the second phase of the study, we developed techniques for fitting speech generated by a new codec under investigation into a previously established stimulus space. The user is provided with a collection of speech samples and with the stimulus space for these speech samples as determined by a large-scale listening test. The user then carries out a much smaller listening test to determine the position of the new stimulus in the previously established stimulus space. This system is anchorable, so that different versions of a codec under development can be compared directly, and it provides more detailed information than the single number provided by MOS testing. We suggest that this information could be used to advantage in algorithm development and in development of objective measures of speech quality.</abstract><cop>United States</cop><pmid>11681393</pmid><doi>10.1121/1.1397322</doi><tpages>16</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0001-4966
ispartof The Journal of the Acoustical Society of America, 2001-10, Vol.110 (4), p.2167-2182
issn 0001-4966
1520-8524
language eng
recordid cdi_proquest_miscellaneous_85576258
source American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list); Linguistics and Language Behavior Abstracts (LLBA)
subjects Adult
Aged
Female
Humans
Individuality
Judgment
Male
Middle Aged
Sound Spectrography
Speech Acoustics
Speech Perception
Voice Quality
title Application of multidimensional scaling to subjective evaluation of coded speech
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T05%3A49%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Application%20of%20multidimensional%20scaling%20to%20subjective%20evaluation%20of%20coded%20speech&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Hall,%20J%20L&rft.date=2001-10-01&rft.volume=110&rft.issue=4&rft.spage=2167&rft.epage=2182&rft.pages=2167-2182&rft.issn=0001-4966&rft.eissn=1520-8524&rft.coden=JASMAN&rft_id=info:doi/10.1121/1.1397322&rft_dat=%3Cproquest_cross%3E72231484%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=72231484&rft_id=info:pmid/11681393&rfr_iscdi=true