Loading…

Assessment of disordered voice via the first rahmonic

► The amplitude of the first rahmonic peak obtained for connected speech and sustained vowels. ► The amplitude of the first rahmonic peak correlates with perceived hoarseness. ► Period-synchronous and harmonic-limited analyses increase correlation. ► Comparisons between the amplitude of the first ra...

Full description

Saved in:
Bibliographic Details
Published in:Speech communication 2012-06, Vol.54 (5), p.655-663
Main Authors: Alpan, A., Schoentgen, J., Maryn, Y., Grenez, F., Murphy, P.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993
cites cdi_FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993
container_end_page 663
container_issue 5
container_start_page 655
container_title Speech communication
container_volume 54
creator Alpan, A.
Schoentgen, J.
Maryn, Y.
Grenez, F.
Murphy, P.
description ► The amplitude of the first rahmonic peak obtained for connected speech and sustained vowels. ► The amplitude of the first rahmonic peak correlates with perceived hoarseness. ► Period-synchronous and harmonic-limited analyses increase correlation. ► Comparisons between the amplitude of the first rahmonic peak and cepstral peak prominence. A number of studies have shown that the amplitude of the first rahmonic peak (R1) in the cepstrum can be usefully employed to indicate hoarse voice quality. The cepstrum is obtained by taking the inverse Fourier transform of the log-magnitude spectrum. In the present study, a number of spectral pre-processing steps are investigated prior to computing the cepstrum; the pre-processing steps include period-synchronous, period-asynchronous, harmonic-synchronous and harmonic-asynchronous spectral band-limitation analysis. The analysis is applied on both sustained vowels [a] and connected speech signals. The correlation between R1 (the amplitude of the first rahmonic) and perceptual ratings is examined for a corpus comprising 251 speakers. It is observed that the correlation between R1 and perceptual ratings increases when the spectrum is band-limited prior to computing the cepstrum. In addition, comparisons are made with a previously reported cepstral cue, cepstral peak prominence (CPP).
doi_str_mv 10.1016/j.specom.2011.04.001
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1536169670</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0167639311000628</els_id><sourcerecordid>1536169670</sourcerecordid><originalsourceid>FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993</originalsourceid><addsrcrecordid>eNqF0D1PwzAQgGELgUQp_AOGjCwJPttx7AWpqviSKrHAbBnnorpq4uJLK_HvCSoznW557k56GbsFXgEHfb-paIch9ZXgABVXFedwxmZgGlE2YMQ5m02sKbW08pJdEW0458oYMWP1ggiJehzGInVFGynlFjO2xSHFgMUh-mJcY9HFTGOR_bpPQwzX7KLzW8KbvzlnH0-P78uXcvX2_LpcrMqgeD2WLVcWfdNwqVWwKHUNxhtp62A-lZKtCLIzoYbOCB0MKi-0DAJBeailsVbO2d3x7i6nrz3S6PpIAbdbP2Dak5uYBm319OEkFQKUtNY0pykXwhiQoCeqjjTkRJSxc7sce5-_J-R-27uNO7Z3v-0dV25qP609HNdwinOImB2FiEPANmYMo2tT_P_ADyDZjEA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1022881316</pqid></control><display><type>article</type><title>Assessment of disordered voice via the first rahmonic</title><source>ScienceDirect Freedom Collection 2022-2024</source><source>Linguistics and Language Behavior Abstracts (LLBA)</source><creator>Alpan, A. ; Schoentgen, J. ; Maryn, Y. ; Grenez, F. ; Murphy, P.</creator><creatorcontrib>Alpan, A. ; Schoentgen, J. ; Maryn, Y. ; Grenez, F. ; Murphy, P.</creatorcontrib><description>► The amplitude of the first rahmonic peak obtained for connected speech and sustained vowels. ► The amplitude of the first rahmonic peak correlates with perceived hoarseness. ► Period-synchronous and harmonic-limited analyses increase correlation. ► Comparisons between the amplitude of the first rahmonic peak and cepstral peak prominence. A number of studies have shown that the amplitude of the first rahmonic peak (R1) in the cepstrum can be usefully employed to indicate hoarse voice quality. The cepstrum is obtained by taking the inverse Fourier transform of the log-magnitude spectrum. In the present study, a number of spectral pre-processing steps are investigated prior to computing the cepstrum; the pre-processing steps include period-synchronous, period-asynchronous, harmonic-synchronous and harmonic-asynchronous spectral band-limitation analysis. The analysis is applied on both sustained vowels [a] and connected speech signals. The correlation between R1 (the amplitude of the first rahmonic) and perceptual ratings is examined for a corpus comprising 251 speakers. It is observed that the correlation between R1 and perceptual ratings increases when the spectrum is band-limited prior to computing the cepstrum. In addition, comparisons are made with a previously reported cepstral cue, cepstral peak prominence (CPP).</description><identifier>ISSN: 0167-6393</identifier><identifier>EISSN: 1872-7182</identifier><identifier>DOI: 10.1016/j.specom.2011.04.001</identifier><identifier>CODEN: SCOMDH</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Amplitudes ; Band spectra ; Cepstrum ; Computation ; Connected speech ; Correlation ; Correlation analysis ; Disordered voice analysis ; First rahmonic ; Ratings ; Spectra ; Speech ; Sustained vowel ; Voice</subject><ispartof>Speech communication, 2012-06, Vol.54 (5), p.655-663</ispartof><rights>2011 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993</citedby><cites>FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925,31270</link.rule.ids></links><search><creatorcontrib>Alpan, A.</creatorcontrib><creatorcontrib>Schoentgen, J.</creatorcontrib><creatorcontrib>Maryn, Y.</creatorcontrib><creatorcontrib>Grenez, F.</creatorcontrib><creatorcontrib>Murphy, P.</creatorcontrib><title>Assessment of disordered voice via the first rahmonic</title><title>Speech communication</title><description>► The amplitude of the first rahmonic peak obtained for connected speech and sustained vowels. ► The amplitude of the first rahmonic peak correlates with perceived hoarseness. ► Period-synchronous and harmonic-limited analyses increase correlation. ► Comparisons between the amplitude of the first rahmonic peak and cepstral peak prominence. A number of studies have shown that the amplitude of the first rahmonic peak (R1) in the cepstrum can be usefully employed to indicate hoarse voice quality. The cepstrum is obtained by taking the inverse Fourier transform of the log-magnitude spectrum. In the present study, a number of spectral pre-processing steps are investigated prior to computing the cepstrum; the pre-processing steps include period-synchronous, period-asynchronous, harmonic-synchronous and harmonic-asynchronous spectral band-limitation analysis. The analysis is applied on both sustained vowels [a] and connected speech signals. The correlation between R1 (the amplitude of the first rahmonic) and perceptual ratings is examined for a corpus comprising 251 speakers. It is observed that the correlation between R1 and perceptual ratings increases when the spectrum is band-limited prior to computing the cepstrum. In addition, comparisons are made with a previously reported cepstral cue, cepstral peak prominence (CPP).</description><subject>Amplitudes</subject><subject>Band spectra</subject><subject>Cepstrum</subject><subject>Computation</subject><subject>Connected speech</subject><subject>Correlation</subject><subject>Correlation analysis</subject><subject>Disordered voice analysis</subject><subject>First rahmonic</subject><subject>Ratings</subject><subject>Spectra</subject><subject>Speech</subject><subject>Sustained vowel</subject><subject>Voice</subject><issn>0167-6393</issn><issn>1872-7182</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>7T9</sourceid><recordid>eNqF0D1PwzAQgGELgUQp_AOGjCwJPttx7AWpqviSKrHAbBnnorpq4uJLK_HvCSoznW557k56GbsFXgEHfb-paIch9ZXgABVXFedwxmZgGlE2YMQ5m02sKbW08pJdEW0458oYMWP1ggiJehzGInVFGynlFjO2xSHFgMUh-mJcY9HFTGOR_bpPQwzX7KLzW8KbvzlnH0-P78uXcvX2_LpcrMqgeD2WLVcWfdNwqVWwKHUNxhtp62A-lZKtCLIzoYbOCB0MKi-0DAJBeailsVbO2d3x7i6nrz3S6PpIAbdbP2Dak5uYBm319OEkFQKUtNY0pykXwhiQoCeqjjTkRJSxc7sce5-_J-R-27uNO7Z3v-0dV25qP609HNdwinOImB2FiEPANmYMo2tT_P_ADyDZjEA</recordid><startdate>201206</startdate><enddate>201206</enddate><creator>Alpan, A.</creator><creator>Schoentgen, J.</creator><creator>Maryn, Y.</creator><creator>Grenez, F.</creator><creator>Murphy, P.</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7T9</scope><scope>8BM</scope></search><sort><creationdate>201206</creationdate><title>Assessment of disordered voice via the first rahmonic</title><author>Alpan, A. ; Schoentgen, J. ; Maryn, Y. ; Grenez, F. ; Murphy, P.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Amplitudes</topic><topic>Band spectra</topic><topic>Cepstrum</topic><topic>Computation</topic><topic>Connected speech</topic><topic>Correlation</topic><topic>Correlation analysis</topic><topic>Disordered voice analysis</topic><topic>First rahmonic</topic><topic>Ratings</topic><topic>Spectra</topic><topic>Speech</topic><topic>Sustained vowel</topic><topic>Voice</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Alpan, A.</creatorcontrib><creatorcontrib>Schoentgen, J.</creatorcontrib><creatorcontrib>Maryn, Y.</creatorcontrib><creatorcontrib>Grenez, F.</creatorcontrib><creatorcontrib>Murphy, P.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><collection>ComDisDome</collection><jtitle>Speech communication</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Alpan, A.</au><au>Schoentgen, J.</au><au>Maryn, Y.</au><au>Grenez, F.</au><au>Murphy, P.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Assessment of disordered voice via the first rahmonic</atitle><jtitle>Speech communication</jtitle><date>2012-06</date><risdate>2012</risdate><volume>54</volume><issue>5</issue><spage>655</spage><epage>663</epage><pages>655-663</pages><issn>0167-6393</issn><eissn>1872-7182</eissn><coden>SCOMDH</coden><abstract>► The amplitude of the first rahmonic peak obtained for connected speech and sustained vowels. ► The amplitude of the first rahmonic peak correlates with perceived hoarseness. ► Period-synchronous and harmonic-limited analyses increase correlation. ► Comparisons between the amplitude of the first rahmonic peak and cepstral peak prominence. A number of studies have shown that the amplitude of the first rahmonic peak (R1) in the cepstrum can be usefully employed to indicate hoarse voice quality. The cepstrum is obtained by taking the inverse Fourier transform of the log-magnitude spectrum. In the present study, a number of spectral pre-processing steps are investigated prior to computing the cepstrum; the pre-processing steps include period-synchronous, period-asynchronous, harmonic-synchronous and harmonic-asynchronous spectral band-limitation analysis. The analysis is applied on both sustained vowels [a] and connected speech signals. The correlation between R1 (the amplitude of the first rahmonic) and perceptual ratings is examined for a corpus comprising 251 speakers. It is observed that the correlation between R1 and perceptual ratings increases when the spectrum is band-limited prior to computing the cepstrum. In addition, comparisons are made with a previously reported cepstral cue, cepstral peak prominence (CPP).</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.specom.2011.04.001</doi><tpages>9</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0167-6393
ispartof Speech communication, 2012-06, Vol.54 (5), p.655-663
issn 0167-6393
1872-7182
language eng
recordid cdi_proquest_miscellaneous_1536169670
source ScienceDirect Freedom Collection 2022-2024; Linguistics and Language Behavior Abstracts (LLBA)
subjects Amplitudes
Band spectra
Cepstrum
Computation
Connected speech
Correlation
Correlation analysis
Disordered voice analysis
First rahmonic
Ratings
Spectra
Speech
Sustained vowel
Voice
title Assessment of disordered voice via the first rahmonic
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T12%3A34%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Assessment%20of%20disordered%20voice%20via%20the%20first%20rahmonic&rft.jtitle=Speech%20communication&rft.au=Alpan,%20A.&rft.date=2012-06&rft.volume=54&rft.issue=5&rft.spage=655&rft.epage=663&rft.pages=655-663&rft.issn=0167-6393&rft.eissn=1872-7182&rft.coden=SCOMDH&rft_id=info:doi/10.1016/j.specom.2011.04.001&rft_dat=%3Cproquest_cross%3E1536169670%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1022881316&rft_id=info:pmid/&rfr_iscdi=true