Loading…

Bivariate analysis of disordered connected speech using temporal and spectral acoustic cues

The presentation concerns the assessment of disordered voices produced by dysphonic speakers. The empirical mode decomposition algorithm is used to decompose the log of the magnitude spectrum of the speech signal into its harmonic, envelope and noise components and the harmonic-to-noise ratio (HNR)...

Full description

Saved in:
Bibliographic Details
Main Authors: Kacha, A., Grenez, F., Schoentgen, J.
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The presentation concerns the assessment of disordered voices produced by dysphonic speakers. The empirical mode decomposition algorithm is used to decompose the log of the magnitude spectrum of the speech signal into its harmonic, envelope and noise components and the harmonic-to-noise ratio (HNR) is used to summarize the overall quality of the disordered voices. The present study aims at improving a previously proposed algorithm by incorporating an appropriate method that estimates automatically the thresholds required by the algorithm without knowledge of the fundamental frequency and combining the temporal acoustic marker named segmental signal-to-dysperiodicity ratio (SDRSEG) with the harmonic-to-noise ratio in order to predict the degree of perceived hoarseness. The performances of the bivariate analysis-based approach for vocal dysperiodicities assessment in terms of correlation of the predicted perceived grade scores with the original perceived degree of hoarseness are investigated using a large corpus comprising concatenations of two Dutch sentences followed by vowel [a].
ISSN:1520-6149
2379-190X
DOI:10.1109/ICASSP.2014.6853744