Loading…

Modeling the voice source in terms of spectral slopesa

A psychoacoustic model of the voice source spectrum is proposed. The model is characterized by four spectral slope parameters: the difference in amplitude between the first two harmonics (H1–H2), the second and fourth harmonics (H2–H4), the fourth harmonic and the harmonic nearest 2 kHz in frequency...

Full description

Saved in:
Bibliographic Details
Published in:The Journal of the Acoustical Society of America 2016-03, Vol.139 (3), p.1404-1410
Main Authors: Garellek, Marc, Samlan, Robin, Gerratt, Bruce R., Kreiman, Jody
Format: Article
Language:English
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 1410
container_issue 3
container_start_page 1404
container_title The Journal of the Acoustical Society of America
container_volume 139
creator Garellek, Marc
Samlan, Robin
Gerratt, Bruce R.
Kreiman, Jody
description A psychoacoustic model of the voice source spectrum is proposed. The model is characterized by four spectral slope parameters: the difference in amplitude between the first two harmonics (H1–H2), the second and fourth harmonics (H2–H4), the fourth harmonic and the harmonic nearest 2 kHz in frequency (H4–2 kHz), and the harmonic nearest 2 kHz and that nearest 5 kHz (2 kHz–5 kHz). As a step toward model validation, experiments were conducted to establish the acoustic and perceptual independence of these parameters. In experiment 1, the model was fit to a large number of voice sources. Results showed that parameters are predictable from one another, but that these relationships are due to overall spectral roll-off. Two additional experiments addressed the perceptual independence of the source parameters. Listener sensitivity to H1–H2, H2–H4, and H4–2 kHz did not change as a function of the slope of an adjacent component, suggesting that sensitivity to these components is robust. Listener sensitivity to changes in spectral slope from 2 kHz to 5 kHz depended on complex interactions between spectral slope, spectral noise levels, and H4–2 kHz. It is concluded that the four parameters represent non-redundant acoustic and perceptual aspects of voice quality.
doi_str_mv 10.1121/1.4944474
format article
fullrecord <record><control><sourceid>scitation</sourceid><recordid>TN_cdi_scitation_primary_10_1121_1_4944474</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>jasa</sourcerecordid><originalsourceid>FETCH-scitation_primary_10_1121_1_49444743</originalsourceid><addsrcrecordid>eNqVjr0KwjAURoMoWH8G3yCz0JrbprWdRXFxcw-hTTWSNiU3Fnx7KxTcBKfDB4ePQ8gGWAQQww4iXnDO93xCAkhjFuZpzKckYIxByIssm5MF4mOYaZ4UAckutlJGtzfq74r2VpeKon26AbqlXrkGqa0pdqr0ThqKxnYK5YrMamlQrUcuyfZ0vB7OIZbaS69tKzqnG-leApj4lAkQY1nyQ-6t-4qiq-rkr-c318lMZQ</addsrcrecordid><sourcetype>Enrichment Source</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Modeling the voice source in terms of spectral slopesa</title><source>American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)</source><creator>Garellek, Marc ; Samlan, Robin ; Gerratt, Bruce R. ; Kreiman, Jody</creator><creatorcontrib>Garellek, Marc ; Samlan, Robin ; Gerratt, Bruce R. ; Kreiman, Jody</creatorcontrib><description>A psychoacoustic model of the voice source spectrum is proposed. The model is characterized by four spectral slope parameters: the difference in amplitude between the first two harmonics (H1–H2), the second and fourth harmonics (H2–H4), the fourth harmonic and the harmonic nearest 2 kHz in frequency (H4–2 kHz), and the harmonic nearest 2 kHz and that nearest 5 kHz (2 kHz–5 kHz). As a step toward model validation, experiments were conducted to establish the acoustic and perceptual independence of these parameters. In experiment 1, the model was fit to a large number of voice sources. Results showed that parameters are predictable from one another, but that these relationships are due to overall spectral roll-off. Two additional experiments addressed the perceptual independence of the source parameters. Listener sensitivity to H1–H2, H2–H4, and H4–2 kHz did not change as a function of the slope of an adjacent component, suggesting that sensitivity to these components is robust. Listener sensitivity to changes in spectral slope from 2 kHz to 5 kHz depended on complex interactions between spectral slope, spectral noise levels, and H4–2 kHz. It is concluded that the four parameters represent non-redundant acoustic and perceptual aspects of voice quality.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.4944474</identifier><identifier>CODEN: JASMAN</identifier><language>eng</language><ispartof>The Journal of the Acoustical Society of America, 2016-03, Vol.139 (3), p.1404-1410</ispartof><rights>Acoustical Society of America</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Garellek, Marc</creatorcontrib><creatorcontrib>Samlan, Robin</creatorcontrib><creatorcontrib>Gerratt, Bruce R.</creatorcontrib><creatorcontrib>Kreiman, Jody</creatorcontrib><title>Modeling the voice source in terms of spectral slopesa</title><title>The Journal of the Acoustical Society of America</title><description>A psychoacoustic model of the voice source spectrum is proposed. The model is characterized by four spectral slope parameters: the difference in amplitude between the first two harmonics (H1–H2), the second and fourth harmonics (H2–H4), the fourth harmonic and the harmonic nearest 2 kHz in frequency (H4–2 kHz), and the harmonic nearest 2 kHz and that nearest 5 kHz (2 kHz–5 kHz). As a step toward model validation, experiments were conducted to establish the acoustic and perceptual independence of these parameters. In experiment 1, the model was fit to a large number of voice sources. Results showed that parameters are predictable from one another, but that these relationships are due to overall spectral roll-off. Two additional experiments addressed the perceptual independence of the source parameters. Listener sensitivity to H1–H2, H2–H4, and H4–2 kHz did not change as a function of the slope of an adjacent component, suggesting that sensitivity to these components is robust. Listener sensitivity to changes in spectral slope from 2 kHz to 5 kHz depended on complex interactions between spectral slope, spectral noise levels, and H4–2 kHz. It is concluded that the four parameters represent non-redundant acoustic and perceptual aspects of voice quality.</description><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid/><recordid>eNqVjr0KwjAURoMoWH8G3yCz0JrbprWdRXFxcw-hTTWSNiU3Fnx7KxTcBKfDB4ePQ8gGWAQQww4iXnDO93xCAkhjFuZpzKckYIxByIssm5MF4mOYaZ4UAckutlJGtzfq74r2VpeKon26AbqlXrkGqa0pdqr0ThqKxnYK5YrMamlQrUcuyfZ0vB7OIZbaS69tKzqnG-leApj4lAkQY1nyQ-6t-4qiq-rkr-c318lMZQ</recordid><startdate>201603</startdate><enddate>201603</enddate><creator>Garellek, Marc</creator><creator>Samlan, Robin</creator><creator>Gerratt, Bruce R.</creator><creator>Kreiman, Jody</creator><scope/></search><sort><creationdate>201603</creationdate><title>Modeling the voice source in terms of spectral slopesa</title><author>Garellek, Marc ; Samlan, Robin ; Gerratt, Bruce R. ; Kreiman, Jody</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-scitation_primary_10_1121_1_49444743</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Garellek, Marc</creatorcontrib><creatorcontrib>Samlan, Robin</creatorcontrib><creatorcontrib>Gerratt, Bruce R.</creatorcontrib><creatorcontrib>Kreiman, Jody</creatorcontrib><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Garellek, Marc</au><au>Samlan, Robin</au><au>Gerratt, Bruce R.</au><au>Kreiman, Jody</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Modeling the voice source in terms of spectral slopesa</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><date>2016-03</date><risdate>2016</risdate><volume>139</volume><issue>3</issue><spage>1404</spage><epage>1410</epage><pages>1404-1410</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><coden>JASMAN</coden><abstract>A psychoacoustic model of the voice source spectrum is proposed. The model is characterized by four spectral slope parameters: the difference in amplitude between the first two harmonics (H1–H2), the second and fourth harmonics (H2–H4), the fourth harmonic and the harmonic nearest 2 kHz in frequency (H4–2 kHz), and the harmonic nearest 2 kHz and that nearest 5 kHz (2 kHz–5 kHz). As a step toward model validation, experiments were conducted to establish the acoustic and perceptual independence of these parameters. In experiment 1, the model was fit to a large number of voice sources. Results showed that parameters are predictable from one another, but that these relationships are due to overall spectral roll-off. Two additional experiments addressed the perceptual independence of the source parameters. Listener sensitivity to H1–H2, H2–H4, and H4–2 kHz did not change as a function of the slope of an adjacent component, suggesting that sensitivity to these components is robust. Listener sensitivity to changes in spectral slope from 2 kHz to 5 kHz depended on complex interactions between spectral slope, spectral noise levels, and H4–2 kHz. It is concluded that the four parameters represent non-redundant acoustic and perceptual aspects of voice quality.</abstract><doi>10.1121/1.4944474</doi><tpages>7</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0001-4966
ispartof The Journal of the Acoustical Society of America, 2016-03, Vol.139 (3), p.1404-1410
issn 0001-4966
1520-8524
language eng
recordid cdi_scitation_primary_10_1121_1_4944474
source American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)
title Modeling the voice source in terms of spectral slopesa
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T10%3A06%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-scitation&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Modeling%20the%20voice%20source%20in%20terms%20of%20spectral%20slopesa&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Garellek,%20Marc&rft.date=2016-03&rft.volume=139&rft.issue=3&rft.spage=1404&rft.epage=1410&rft.pages=1404-1410&rft.issn=0001-4966&rft.eissn=1520-8524&rft.coden=JASMAN&rft_id=info:doi/10.1121/1.4944474&rft_dat=%3Cscitation%3Ejasa%3C/scitation%3E%3Cgrp_id%3Ecdi_FETCH-scitation_primary_10_1121_1_49444743%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true