Loading…
Application of multidimensional scaling to subjective evaluation of coded speech
We present results from a pilot study directed at developing an anchorable subjective speech quality test. The test uses multidimensional scaling techniques to obtain quantitative information about the perceptual attributes of speech. In the first phase of the study, subjects ranked perceptual dista...
Saved in:
Published in: | The Journal of the Acoustical Society of America 2001-10, Vol.110 (4), p.2167-2182 |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343 |
---|---|
cites | cdi_FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343 |
container_end_page | 2182 |
container_issue | 4 |
container_start_page | 2167 |
container_title | The Journal of the Acoustical Society of America |
container_volume | 110 |
creator | Hall, J L |
description | We present results from a pilot study directed at developing an anchorable subjective speech quality test. The test uses multidimensional scaling techniques to obtain quantitative information about the perceptual attributes of speech. In the first phase of the study, subjects ranked perceptual distances between samples of speech produced by two different talkers, one male and one female, processed by a variety of codecs. The resulting distance matrices were processed to obtain, for each talker, a stimulus space for the various speech samples. This stimulus space has the properties that distances between stimuli in this space correspond to perceptual distances between stimuli and that the dimensions of this space correspond to attributes used by the subjects in determining perceptual distances. Mean opinion scores (MOS) scores obtained in an earlier study were found to be highly correlated with position in the stimulus space, and the three dimensions of the stimulus space were found to have identifiable physical and perceptual correlates. In the second phase of the study, we developed techniques for fitting speech generated by a new codec under investigation into a previously established stimulus space. The user is provided with a collection of speech samples and with the stimulus space for these speech samples as determined by a large-scale listening test. The user then carries out a much smaller listening test to determine the position of the new stimulus in the previously established stimulus space. This system is anchorable, so that different versions of a codec under development can be compared directly, and it provides more detailed information than the single number provided by MOS testing. We suggest that this information could be used to advantage in algorithm development and in development of objective measures of speech quality. |
doi_str_mv | 10.1121/1.1397322 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_85576258</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>72231484</sourcerecordid><originalsourceid>FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343</originalsourceid><addsrcrecordid>eNqFkEtLAzEUhYMotlYX_gGZleBiam4ek8yylPqAgi50PaSZG03JPJzMFPz3TmnRpdzF5V6-c-AcQq6BzgEY3MMceK44YydkCpLRVEsmTsmUUgqpyLNsQi5i3I6n1Dw_JxOATI8SPiWvi7YN3preN3XSuKQaQu9LX2Edx48JSbQm-Poj6ZskDpst2t7vMMGdCcOvyDYllklsEe3nJTlzJkS8Ou4ZeX9YvS2f0vXL4_NysU4tF6pPwTlrlDFMM6FBCWZkpnMAgY4y1EBLBKlQCpsrIcGY0jhXKo1SCyG44DNye_Btu-ZrwNgXlY8WQzA1NkMstJQqY2Pe_0DFGAeh9453B9B2TYwduqLtfGW67wJose-5GOfQ88jeHE2HTYXlH3kslv8AO4d3WA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>72231484</pqid></control><display><type>article</type><title>Application of multidimensional scaling to subjective evaluation of coded speech</title><source>American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)</source><source>Linguistics and Language Behavior Abstracts (LLBA)</source><creator>Hall, J L</creator><creatorcontrib>Hall, J L</creatorcontrib><description>We present results from a pilot study directed at developing an anchorable subjective speech quality test. The test uses multidimensional scaling techniques to obtain quantitative information about the perceptual attributes of speech. In the first phase of the study, subjects ranked perceptual distances between samples of speech produced by two different talkers, one male and one female, processed by a variety of codecs. The resulting distance matrices were processed to obtain, for each talker, a stimulus space for the various speech samples. This stimulus space has the properties that distances between stimuli in this space correspond to perceptual distances between stimuli and that the dimensions of this space correspond to attributes used by the subjects in determining perceptual distances. Mean opinion scores (MOS) scores obtained in an earlier study were found to be highly correlated with position in the stimulus space, and the three dimensions of the stimulus space were found to have identifiable physical and perceptual correlates. In the second phase of the study, we developed techniques for fitting speech generated by a new codec under investigation into a previously established stimulus space. The user is provided with a collection of speech samples and with the stimulus space for these speech samples as determined by a large-scale listening test. The user then carries out a much smaller listening test to determine the position of the new stimulus in the previously established stimulus space. This system is anchorable, so that different versions of a codec under development can be compared directly, and it provides more detailed information than the single number provided by MOS testing. We suggest that this information could be used to advantage in algorithm development and in development of objective measures of speech quality.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.1397322</identifier><identifier>PMID: 11681393</identifier><identifier>CODEN: JASMAN</identifier><language>eng</language><publisher>United States</publisher><subject>Adult ; Aged ; Female ; Humans ; Individuality ; Judgment ; Male ; Middle Aged ; Sound Spectrography ; Speech Acoustics ; Speech Perception ; Voice Quality</subject><ispartof>The Journal of the Acoustical Society of America, 2001-10, Vol.110 (4), p.2167-2182</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343</citedby><cites>FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925,31270</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/11681393$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hall, J L</creatorcontrib><title>Application of multidimensional scaling to subjective evaluation of coded speech</title><title>The Journal of the Acoustical Society of America</title><addtitle>J Acoust Soc Am</addtitle><description>We present results from a pilot study directed at developing an anchorable subjective speech quality test. The test uses multidimensional scaling techniques to obtain quantitative information about the perceptual attributes of speech. In the first phase of the study, subjects ranked perceptual distances between samples of speech produced by two different talkers, one male and one female, processed by a variety of codecs. The resulting distance matrices were processed to obtain, for each talker, a stimulus space for the various speech samples. This stimulus space has the properties that distances between stimuli in this space correspond to perceptual distances between stimuli and that the dimensions of this space correspond to attributes used by the subjects in determining perceptual distances. Mean opinion scores (MOS) scores obtained in an earlier study were found to be highly correlated with position in the stimulus space, and the three dimensions of the stimulus space were found to have identifiable physical and perceptual correlates. In the second phase of the study, we developed techniques for fitting speech generated by a new codec under investigation into a previously established stimulus space. The user is provided with a collection of speech samples and with the stimulus space for these speech samples as determined by a large-scale listening test. The user then carries out a much smaller listening test to determine the position of the new stimulus in the previously established stimulus space. This system is anchorable, so that different versions of a codec under development can be compared directly, and it provides more detailed information than the single number provided by MOS testing. We suggest that this information could be used to advantage in algorithm development and in development of objective measures of speech quality.</description><subject>Adult</subject><subject>Aged</subject><subject>Female</subject><subject>Humans</subject><subject>Individuality</subject><subject>Judgment</subject><subject>Male</subject><subject>Middle Aged</subject><subject>Sound Spectrography</subject><subject>Speech Acoustics</subject><subject>Speech Perception</subject><subject>Voice Quality</subject><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2001</creationdate><recordtype>article</recordtype><sourceid>7T9</sourceid><recordid>eNqFkEtLAzEUhYMotlYX_gGZleBiam4ek8yylPqAgi50PaSZG03JPJzMFPz3TmnRpdzF5V6-c-AcQq6BzgEY3MMceK44YydkCpLRVEsmTsmUUgqpyLNsQi5i3I6n1Dw_JxOATI8SPiWvi7YN3preN3XSuKQaQu9LX2Edx48JSbQm-Poj6ZskDpst2t7vMMGdCcOvyDYllklsEe3nJTlzJkS8Ou4ZeX9YvS2f0vXL4_NysU4tF6pPwTlrlDFMM6FBCWZkpnMAgY4y1EBLBKlQCpsrIcGY0jhXKo1SCyG44DNye_Btu-ZrwNgXlY8WQzA1NkMstJQqY2Pe_0DFGAeh9453B9B2TYwduqLtfGW67wJose-5GOfQ88jeHE2HTYXlH3kslv8AO4d3WA</recordid><startdate>20011001</startdate><enddate>20011001</enddate><creator>Hall, J L</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>8BM</scope><scope>7T9</scope></search><sort><creationdate>20011001</creationdate><title>Application of multidimensional scaling to subjective evaluation of coded speech</title><author>Hall, J L</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2001</creationdate><topic>Adult</topic><topic>Aged</topic><topic>Female</topic><topic>Humans</topic><topic>Individuality</topic><topic>Judgment</topic><topic>Male</topic><topic>Middle Aged</topic><topic>Sound Spectrography</topic><topic>Speech Acoustics</topic><topic>Speech Perception</topic><topic>Voice Quality</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hall, J L</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>ComDisDome</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hall, J L</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Application of multidimensional scaling to subjective evaluation of coded speech</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><addtitle>J Acoust Soc Am</addtitle><date>2001-10-01</date><risdate>2001</risdate><volume>110</volume><issue>4</issue><spage>2167</spage><epage>2182</epage><pages>2167-2182</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><coden>JASMAN</coden><abstract>We present results from a pilot study directed at developing an anchorable subjective speech quality test. The test uses multidimensional scaling techniques to obtain quantitative information about the perceptual attributes of speech. In the first phase of the study, subjects ranked perceptual distances between samples of speech produced by two different talkers, one male and one female, processed by a variety of codecs. The resulting distance matrices were processed to obtain, for each talker, a stimulus space for the various speech samples. This stimulus space has the properties that distances between stimuli in this space correspond to perceptual distances between stimuli and that the dimensions of this space correspond to attributes used by the subjects in determining perceptual distances. Mean opinion scores (MOS) scores obtained in an earlier study were found to be highly correlated with position in the stimulus space, and the three dimensions of the stimulus space were found to have identifiable physical and perceptual correlates. In the second phase of the study, we developed techniques for fitting speech generated by a new codec under investigation into a previously established stimulus space. The user is provided with a collection of speech samples and with the stimulus space for these speech samples as determined by a large-scale listening test. The user then carries out a much smaller listening test to determine the position of the new stimulus in the previously established stimulus space. This system is anchorable, so that different versions of a codec under development can be compared directly, and it provides more detailed information than the single number provided by MOS testing. We suggest that this information could be used to advantage in algorithm development and in development of objective measures of speech quality.</abstract><cop>United States</cop><pmid>11681393</pmid><doi>10.1121/1.1397322</doi><tpages>16</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0001-4966 |
ispartof | The Journal of the Acoustical Society of America, 2001-10, Vol.110 (4), p.2167-2182 |
issn | 0001-4966 1520-8524 |
language | eng |
recordid | cdi_proquest_miscellaneous_85576258 |
source | American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list); Linguistics and Language Behavior Abstracts (LLBA) |
subjects | Adult Aged Female Humans Individuality Judgment Male Middle Aged Sound Spectrography Speech Acoustics Speech Perception Voice Quality |
title | Application of multidimensional scaling to subjective evaluation of coded speech |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T05%3A49%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Application%20of%20multidimensional%20scaling%20to%20subjective%20evaluation%20of%20coded%20speech&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Hall,%20J%20L&rft.date=2001-10-01&rft.volume=110&rft.issue=4&rft.spage=2167&rft.epage=2182&rft.pages=2167-2182&rft.issn=0001-4966&rft.eissn=1520-8524&rft.coden=JASMAN&rft_id=info:doi/10.1121/1.1397322&rft_dat=%3Cproquest_cross%3E72231484%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c347t-1ffca7aa282481742a5689114ef02e810de157e54c97451aadaffd78e58444343%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=72231484&rft_id=info:pmid/11681393&rfr_iscdi=true |