Loading…

Perceptual importance of time-domain features of the voice source

Our previous study examined the perceptual adequacy of different source models. We found that perceived similarity between modeled and natural voice samples was best predicted (in the time dimension) by thematch between waveforms at the negative peak of the flow derivative (R2 = 0.34). The extent of...

Full description

Saved in:
Bibliographic Details
Published in:The Journal of the Acoustical Society of America 2014-04, Vol.135 (4_Supplement), p.2422-2422
Main Authors: Garellek, Marc, Chen, Gang, Gerratt, Bruce R., Alwan, Abeer, Kreiman, Jody E.
Format: Article
Language:English
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Our previous study examined the perceptual adequacy of different source models. We found that perceived similarity between modeled and natural voice samples was best predicted (in the time dimension) by thematch between waveforms at the negative peak of the flow derivative (R2 = 0.34). The extent of fit during the opening phase of the source pulses added only 2% to perceived match. However, in that study model, fitting was unweighted, and results might differ if another approach were used. In this study, we constrained the models to fit the negative peak of the flow derivative precisely. We fit 6 different source models to 40 natural voice sources, and then generated synthetic copies of the voices using each modeled source pulse, with all other synthesizer parameters held constant. We then conducted a visual sort-and-rate task in which listeners assessed the extent of perceived match between the original natural voice samples and each copy. Discussion will focus on the specific strengths and weaknesses of each modeling approach for characterizing differences in vocal quality, and on the importance of matches to specific time-domain events versus spectral features in determining voice quality. [Work supported by NIH/NIDCD grant DC01797 and NSF grant IIS-1018863.]
ISSN:0001-4966
1520-8524
DOI:10.1121/1.4878046