Loading…
Age, sex, and vowel dependencies of acoustic measures related to the voice source
The effects of age, sex, and vocal tract configuration on the glottal excitation signal in speech are only partially understood, yet understanding these effects is important for both recognition and synthesis of speech as well as for medical purposes. In this paper, three acoustic measures related t...
Saved in:
Published in: | The Journal of the Acoustical Society of America 2007-04, Vol.121 (4), p.2283-2295 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The effects of age, sex, and vocal tract configuration on the glottal excitation signal in speech are only partially understood, yet understanding these effects is important for both recognition and synthesis of speech as well as for medical purposes. In this paper, three acoustic measures related to the voice source are analyzed for five vowels from 3145 CVC utterances spoken by 335 talkers (
8
-
39
years
old) from the CID database [
Miller
,
Proceedings of ICASSP
,
1996
, Vol.
2
, pp.
849-852
]. The measures are: the fundamental frequency
(
F
0
)
, the difference between the "corrected" (denoted by an asterisk) first two spectral harmonic magnitudes,
H
1
*
−
H
2
*
(related to the open quotient), and the difference between the "corrected" magnitudes of the first spectral harmonic and that of the third formant peak,
H
1
*
−
A
3
*
(related to source spectral tilt). The correction refers to compensating for the influence of formant frequencies on spectral magnitude estimation. Experimental results show that the three acoustic measures are dependent to varying degrees on age and vowel. Age dependencies are more prominent for male talkers, while vowel dependencies are more prominent for female talkers suggesting a greater vocal tract-source interaction. All talkers show a dependency of
F
0
on sex and on
F
3
, and of
H
1
*
−
A
3
*
on vowel type. For low-pitched talkers
(
F
0
⩽
175
Hz
)
,
H
1
*
−
H
2
*
is positively correlated with
F
0
while for high-pitched talkers,
H
1
*
−
H
2
*
is dependent on
F
1
or vowel height. For high-pitched talkers there were no significant sex dependencies of
H
1
*
−
H
2
*
and
H
1
*
−
A
3
*
. The statistical significance of these results is shown. |
---|---|
ISSN: | 0001-4966 1520-8524 |
DOI: | 10.1121/1.2697522 |