Loading…
Assessment of disordered voice via the first rahmonic
► The amplitude of the first rahmonic peak obtained for connected speech and sustained vowels. ► The amplitude of the first rahmonic peak correlates with perceived hoarseness. ► Period-synchronous and harmonic-limited analyses increase correlation. ► Comparisons between the amplitude of the first ra...
Saved in:
Published in: | Speech communication 2012-06, Vol.54 (5), p.655-663 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993 |
---|---|
cites | cdi_FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993 |
container_end_page | 663 |
container_issue | 5 |
container_start_page | 655 |
container_title | Speech communication |
container_volume | 54 |
creator | Alpan, A. Schoentgen, J. Maryn, Y. Grenez, F. Murphy, P. |
description | ► The amplitude of the first rahmonic peak obtained for connected speech and sustained vowels. ► The amplitude of the first rahmonic peak correlates with perceived hoarseness. ► Period-synchronous and harmonic-limited analyses increase correlation. ► Comparisons between the amplitude of the first rahmonic peak and cepstral peak prominence.
A number of studies have shown that the amplitude of the first rahmonic peak (R1) in the cepstrum can be usefully employed to indicate hoarse voice quality. The cepstrum is obtained by taking the inverse Fourier transform of the log-magnitude spectrum. In the present study, a number of spectral pre-processing steps are investigated prior to computing the cepstrum; the pre-processing steps include period-synchronous, period-asynchronous, harmonic-synchronous and harmonic-asynchronous spectral band-limitation analysis. The analysis is applied on both sustained vowels [a] and connected speech signals. The correlation between R1 (the amplitude of the first rahmonic) and perceptual ratings is examined for a corpus comprising 251 speakers. It is observed that the correlation between R1 and perceptual ratings increases when the spectrum is band-limited prior to computing the cepstrum. In addition, comparisons are made with a previously reported cepstral cue, cepstral peak prominence (CPP). |
doi_str_mv | 10.1016/j.specom.2011.04.001 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1536169670</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0167639311000628</els_id><sourcerecordid>1536169670</sourcerecordid><originalsourceid>FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993</originalsourceid><addsrcrecordid>eNqF0D1PwzAQgGELgUQp_AOGjCwJPttx7AWpqviSKrHAbBnnorpq4uJLK_HvCSoznW557k56GbsFXgEHfb-paIch9ZXgABVXFedwxmZgGlE2YMQ5m02sKbW08pJdEW0458oYMWP1ggiJehzGInVFGynlFjO2xSHFgMUh-mJcY9HFTGOR_bpPQwzX7KLzW8KbvzlnH0-P78uXcvX2_LpcrMqgeD2WLVcWfdNwqVWwKHUNxhtp62A-lZKtCLIzoYbOCB0MKi-0DAJBeailsVbO2d3x7i6nrz3S6PpIAbdbP2Dak5uYBm319OEkFQKUtNY0pykXwhiQoCeqjjTkRJSxc7sce5-_J-R-27uNO7Z3v-0dV25qP609HNdwinOImB2FiEPANmYMo2tT_P_ADyDZjEA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1022881316</pqid></control><display><type>article</type><title>Assessment of disordered voice via the first rahmonic</title><source>ScienceDirect Freedom Collection 2022-2024</source><source>Linguistics and Language Behavior Abstracts (LLBA)</source><creator>Alpan, A. ; Schoentgen, J. ; Maryn, Y. ; Grenez, F. ; Murphy, P.</creator><creatorcontrib>Alpan, A. ; Schoentgen, J. ; Maryn, Y. ; Grenez, F. ; Murphy, P.</creatorcontrib><description>► The amplitude of the first rahmonic peak obtained for connected speech and sustained vowels. ► The amplitude of the first rahmonic peak correlates with perceived hoarseness. ► Period-synchronous and harmonic-limited analyses increase correlation. ► Comparisons between the amplitude of the first rahmonic peak and cepstral peak prominence.
A number of studies have shown that the amplitude of the first rahmonic peak (R1) in the cepstrum can be usefully employed to indicate hoarse voice quality. The cepstrum is obtained by taking the inverse Fourier transform of the log-magnitude spectrum. In the present study, a number of spectral pre-processing steps are investigated prior to computing the cepstrum; the pre-processing steps include period-synchronous, period-asynchronous, harmonic-synchronous and harmonic-asynchronous spectral band-limitation analysis. The analysis is applied on both sustained vowels [a] and connected speech signals. The correlation between R1 (the amplitude of the first rahmonic) and perceptual ratings is examined for a corpus comprising 251 speakers. It is observed that the correlation between R1 and perceptual ratings increases when the spectrum is band-limited prior to computing the cepstrum. In addition, comparisons are made with a previously reported cepstral cue, cepstral peak prominence (CPP).</description><identifier>ISSN: 0167-6393</identifier><identifier>EISSN: 1872-7182</identifier><identifier>DOI: 10.1016/j.specom.2011.04.001</identifier><identifier>CODEN: SCOMDH</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Amplitudes ; Band spectra ; Cepstrum ; Computation ; Connected speech ; Correlation ; Correlation analysis ; Disordered voice analysis ; First rahmonic ; Ratings ; Spectra ; Speech ; Sustained vowel ; Voice</subject><ispartof>Speech communication, 2012-06, Vol.54 (5), p.655-663</ispartof><rights>2011 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993</citedby><cites>FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925,31270</link.rule.ids></links><search><creatorcontrib>Alpan, A.</creatorcontrib><creatorcontrib>Schoentgen, J.</creatorcontrib><creatorcontrib>Maryn, Y.</creatorcontrib><creatorcontrib>Grenez, F.</creatorcontrib><creatorcontrib>Murphy, P.</creatorcontrib><title>Assessment of disordered voice via the first rahmonic</title><title>Speech communication</title><description>► The amplitude of the first rahmonic peak obtained for connected speech and sustained vowels. ► The amplitude of the first rahmonic peak correlates with perceived hoarseness. ► Period-synchronous and harmonic-limited analyses increase correlation. ► Comparisons between the amplitude of the first rahmonic peak and cepstral peak prominence.
A number of studies have shown that the amplitude of the first rahmonic peak (R1) in the cepstrum can be usefully employed to indicate hoarse voice quality. The cepstrum is obtained by taking the inverse Fourier transform of the log-magnitude spectrum. In the present study, a number of spectral pre-processing steps are investigated prior to computing the cepstrum; the pre-processing steps include period-synchronous, period-asynchronous, harmonic-synchronous and harmonic-asynchronous spectral band-limitation analysis. The analysis is applied on both sustained vowels [a] and connected speech signals. The correlation between R1 (the amplitude of the first rahmonic) and perceptual ratings is examined for a corpus comprising 251 speakers. It is observed that the correlation between R1 and perceptual ratings increases when the spectrum is band-limited prior to computing the cepstrum. In addition, comparisons are made with a previously reported cepstral cue, cepstral peak prominence (CPP).</description><subject>Amplitudes</subject><subject>Band spectra</subject><subject>Cepstrum</subject><subject>Computation</subject><subject>Connected speech</subject><subject>Correlation</subject><subject>Correlation analysis</subject><subject>Disordered voice analysis</subject><subject>First rahmonic</subject><subject>Ratings</subject><subject>Spectra</subject><subject>Speech</subject><subject>Sustained vowel</subject><subject>Voice</subject><issn>0167-6393</issn><issn>1872-7182</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>7T9</sourceid><recordid>eNqF0D1PwzAQgGELgUQp_AOGjCwJPttx7AWpqviSKrHAbBnnorpq4uJLK_HvCSoznW557k56GbsFXgEHfb-paIch9ZXgABVXFedwxmZgGlE2YMQ5m02sKbW08pJdEW0458oYMWP1ggiJehzGInVFGynlFjO2xSHFgMUh-mJcY9HFTGOR_bpPQwzX7KLzW8KbvzlnH0-P78uXcvX2_LpcrMqgeD2WLVcWfdNwqVWwKHUNxhtp62A-lZKtCLIzoYbOCB0MKi-0DAJBeailsVbO2d3x7i6nrz3S6PpIAbdbP2Dak5uYBm319OEkFQKUtNY0pykXwhiQoCeqjjTkRJSxc7sce5-_J-R-27uNO7Z3v-0dV25qP609HNdwinOImB2FiEPANmYMo2tT_P_ADyDZjEA</recordid><startdate>201206</startdate><enddate>201206</enddate><creator>Alpan, A.</creator><creator>Schoentgen, J.</creator><creator>Maryn, Y.</creator><creator>Grenez, F.</creator><creator>Murphy, P.</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7T9</scope><scope>8BM</scope></search><sort><creationdate>201206</creationdate><title>Assessment of disordered voice via the first rahmonic</title><author>Alpan, A. ; Schoentgen, J. ; Maryn, Y. ; Grenez, F. ; Murphy, P.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Amplitudes</topic><topic>Band spectra</topic><topic>Cepstrum</topic><topic>Computation</topic><topic>Connected speech</topic><topic>Correlation</topic><topic>Correlation analysis</topic><topic>Disordered voice analysis</topic><topic>First rahmonic</topic><topic>Ratings</topic><topic>Spectra</topic><topic>Speech</topic><topic>Sustained vowel</topic><topic>Voice</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Alpan, A.</creatorcontrib><creatorcontrib>Schoentgen, J.</creatorcontrib><creatorcontrib>Maryn, Y.</creatorcontrib><creatorcontrib>Grenez, F.</creatorcontrib><creatorcontrib>Murphy, P.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><collection>ComDisDome</collection><jtitle>Speech communication</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Alpan, A.</au><au>Schoentgen, J.</au><au>Maryn, Y.</au><au>Grenez, F.</au><au>Murphy, P.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Assessment of disordered voice via the first rahmonic</atitle><jtitle>Speech communication</jtitle><date>2012-06</date><risdate>2012</risdate><volume>54</volume><issue>5</issue><spage>655</spage><epage>663</epage><pages>655-663</pages><issn>0167-6393</issn><eissn>1872-7182</eissn><coden>SCOMDH</coden><abstract>► The amplitude of the first rahmonic peak obtained for connected speech and sustained vowels. ► The amplitude of the first rahmonic peak correlates with perceived hoarseness. ► Period-synchronous and harmonic-limited analyses increase correlation. ► Comparisons between the amplitude of the first rahmonic peak and cepstral peak prominence.
A number of studies have shown that the amplitude of the first rahmonic peak (R1) in the cepstrum can be usefully employed to indicate hoarse voice quality. The cepstrum is obtained by taking the inverse Fourier transform of the log-magnitude spectrum. In the present study, a number of spectral pre-processing steps are investigated prior to computing the cepstrum; the pre-processing steps include period-synchronous, period-asynchronous, harmonic-synchronous and harmonic-asynchronous spectral band-limitation analysis. The analysis is applied on both sustained vowels [a] and connected speech signals. The correlation between R1 (the amplitude of the first rahmonic) and perceptual ratings is examined for a corpus comprising 251 speakers. It is observed that the correlation between R1 and perceptual ratings increases when the spectrum is band-limited prior to computing the cepstrum. In addition, comparisons are made with a previously reported cepstral cue, cepstral peak prominence (CPP).</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.specom.2011.04.001</doi><tpages>9</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0167-6393 |
ispartof | Speech communication, 2012-06, Vol.54 (5), p.655-663 |
issn | 0167-6393 1872-7182 |
language | eng |
recordid | cdi_proquest_miscellaneous_1536169670 |
source | ScienceDirect Freedom Collection 2022-2024; Linguistics and Language Behavior Abstracts (LLBA) |
subjects | Amplitudes Band spectra Cepstrum Computation Connected speech Correlation Correlation analysis Disordered voice analysis First rahmonic Ratings Spectra Speech Sustained vowel Voice |
title | Assessment of disordered voice via the first rahmonic |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T12%3A34%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Assessment%20of%20disordered%20voice%20via%20the%20first%20rahmonic&rft.jtitle=Speech%20communication&rft.au=Alpan,%20A.&rft.date=2012-06&rft.volume=54&rft.issue=5&rft.spage=655&rft.epage=663&rft.pages=655-663&rft.issn=0167-6393&rft.eissn=1872-7182&rft.coden=SCOMDH&rft_id=info:doi/10.1016/j.specom.2011.04.001&rft_dat=%3Cproquest_cross%3E1536169670%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c405t-d049ea770364c9e36518a8395c8b443d2c3f8c51f826c8e4a263c2e14a1538993%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1022881316&rft_id=info:pmid/&rfr_iscdi=true |