Loading…

Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale

Despite the fact that perceptual evaluation is considered as a gold standard for assessing pathological voice quality, the considerably high inter- and intra-listeners variability associated with different perceptual ratings cannot be ignored. This is probably due to other confounding factors such a...

Full description

Saved in:
Bibliographic Details
Published in:Journal of signal processing systems 2016-02, Vol.82 (2), p.241-251
Main Authors: Wang, Zhijian, Yu, Ping, Yan, Nan, Wang, Lan, Ng, Manwa L.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73
cites cdi_FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73
container_end_page 251
container_issue 2
container_start_page 241
container_title Journal of signal processing systems
container_volume 82
creator Wang, Zhijian
Yu, Ping
Yan, Nan
Wang, Lan
Ng, Manwa L.
description Despite the fact that perceptual evaluation is considered as a gold standard for assessing pathological voice quality, the considerably high inter- and intra-listeners variability associated with different perceptual ratings cannot be ignored. This is probably due to other confounding factors such as listeners’ perceptual bias, listeners’ experience and type of rating scale being used. Automatic objective assessment can serve as a useful tool for diagnosis of pathological voices. Acoustic analysis can be useful in determining severity of dysphonia. The present study aimed to develop a complementary automatic voice assessment system by using multidimensional acoustical measures based on the well-known GRBAS perceptual rating scale. A total of 65 dimensionality measures including traditional acoustic methods, MFCC, Glottal-to-Noise Excitation Methods and nonlinear dynamical analysis were used to compose a matrix of features. To reduce redundancy in features, four different feature extraction techniques were applied. The multiclass classification was carried out by means of RBF kernel-SVM and Extreme Learning Machine. The classification results were moderately correlated with GRBAS ratings of severity, with the best accuracy around 77.55 and 80.58 %, respectively. This suggests that such multidimensional acoustic analysis can be an appropriate assessment tool in determining the presence and severity of voice disorders.
doi_str_mv 10.1007/s11265-015-1016-2
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1808056378</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1808056378</sourcerecordid><originalsourceid>FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73</originalsourceid><addsrcrecordid>eNp9kEtPwzAQhCMEEs8fwM1HLoHdBifOMa2gIIF4lqvlOJvWKI0h6xz67zEUrpx2V_pmNDtJcopwjgDFBSNOcpkCyhQB83SykxxgmZWpQpS7fzug2k8Omd8BcigkHiShGoNfm-CsqJiJeU19EL4VjyasfOeXzppOvHlnSTyNpnNhIxbs-qW4H7vgGhd5dr6PUGX9yD9G8dqwYzE1TI3wvQgrEvPnafUiXqIdHSd7remYTn7nUbK4vnqd3aR3D_PbWXWXWokQUgNZXbSlUnlN1mYxskILNpPWmHgZSdK2tVF1mUugupGyoUJBedkoU9ZUZEfJ2db3Y_CfI3HQa8eWus70FLNqVKBA5lmhIopb1A6eeaBWfwxubYaNRtDfDettwzo2rL8b1pOomWw1HNl-SYN-9-MQn-d_RF9eqH-6</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1808056378</pqid></control><display><type>article</type><title>Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale</title><source>Springer Link</source><creator>Wang, Zhijian ; Yu, Ping ; Yan, Nan ; Wang, Lan ; Ng, Manwa L.</creator><creatorcontrib>Wang, Zhijian ; Yu, Ping ; Yan, Nan ; Wang, Lan ; Ng, Manwa L.</creatorcontrib><description>Despite the fact that perceptual evaluation is considered as a gold standard for assessing pathological voice quality, the considerably high inter- and intra-listeners variability associated with different perceptual ratings cannot be ignored. This is probably due to other confounding factors such as listeners’ perceptual bias, listeners’ experience and type of rating scale being used. Automatic objective assessment can serve as a useful tool for diagnosis of pathological voices. Acoustic analysis can be useful in determining severity of dysphonia. The present study aimed to develop a complementary automatic voice assessment system by using multidimensional acoustical measures based on the well-known GRBAS perceptual rating scale. A total of 65 dimensionality measures including traditional acoustic methods, MFCC, Glottal-to-Noise Excitation Methods and nonlinear dynamical analysis were used to compose a matrix of features. To reduce redundancy in features, four different feature extraction techniques were applied. The multiclass classification was carried out by means of RBF kernel-SVM and Extreme Learning Machine. The classification results were moderately correlated with GRBAS ratings of severity, with the best accuracy around 77.55 and 80.58 %, respectively. This suggests that such multidimensional acoustic analysis can be an appropriate assessment tool in determining the presence and severity of voice disorders.</description><identifier>ISSN: 1939-8018</identifier><identifier>EISSN: 1939-8115</identifier><identifier>DOI: 10.1007/s11265-015-1016-2</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Acoustic measurement ; Assessments ; Automation ; Circuits and Systems ; Classification ; Computer Imaging ; Electrical Engineering ; Engineering ; Image Processing and Computer Vision ; Neural networks ; Pattern Recognition ; Pattern Recognition and Graphics ; Ratings ; Signal processing ; Signal,Image and Speech Processing ; Vision ; Voice</subject><ispartof>Journal of signal processing systems, 2016-02, Vol.82 (2), p.241-251</ispartof><rights>Springer Science+Business Media New York 2015</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73</citedby><cites>FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73</cites><orcidid>0000-0002-1613-9301</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Wang, Zhijian</creatorcontrib><creatorcontrib>Yu, Ping</creatorcontrib><creatorcontrib>Yan, Nan</creatorcontrib><creatorcontrib>Wang, Lan</creatorcontrib><creatorcontrib>Ng, Manwa L.</creatorcontrib><title>Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale</title><title>Journal of signal processing systems</title><addtitle>J Sign Process Syst</addtitle><description>Despite the fact that perceptual evaluation is considered as a gold standard for assessing pathological voice quality, the considerably high inter- and intra-listeners variability associated with different perceptual ratings cannot be ignored. This is probably due to other confounding factors such as listeners’ perceptual bias, listeners’ experience and type of rating scale being used. Automatic objective assessment can serve as a useful tool for diagnosis of pathological voices. Acoustic analysis can be useful in determining severity of dysphonia. The present study aimed to develop a complementary automatic voice assessment system by using multidimensional acoustical measures based on the well-known GRBAS perceptual rating scale. A total of 65 dimensionality measures including traditional acoustic methods, MFCC, Glottal-to-Noise Excitation Methods and nonlinear dynamical analysis were used to compose a matrix of features. To reduce redundancy in features, four different feature extraction techniques were applied. The multiclass classification was carried out by means of RBF kernel-SVM and Extreme Learning Machine. The classification results were moderately correlated with GRBAS ratings of severity, with the best accuracy around 77.55 and 80.58 %, respectively. This suggests that such multidimensional acoustic analysis can be an appropriate assessment tool in determining the presence and severity of voice disorders.</description><subject>Acoustic measurement</subject><subject>Assessments</subject><subject>Automation</subject><subject>Circuits and Systems</subject><subject>Classification</subject><subject>Computer Imaging</subject><subject>Electrical Engineering</subject><subject>Engineering</subject><subject>Image Processing and Computer Vision</subject><subject>Neural networks</subject><subject>Pattern Recognition</subject><subject>Pattern Recognition and Graphics</subject><subject>Ratings</subject><subject>Signal processing</subject><subject>Signal,Image and Speech Processing</subject><subject>Vision</subject><subject>Voice</subject><issn>1939-8018</issn><issn>1939-8115</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><recordid>eNp9kEtPwzAQhCMEEs8fwM1HLoHdBifOMa2gIIF4lqvlOJvWKI0h6xz67zEUrpx2V_pmNDtJcopwjgDFBSNOcpkCyhQB83SykxxgmZWpQpS7fzug2k8Omd8BcigkHiShGoNfm-CsqJiJeU19EL4VjyasfOeXzppOvHlnSTyNpnNhIxbs-qW4H7vgGhd5dr6PUGX9yD9G8dqwYzE1TI3wvQgrEvPnafUiXqIdHSd7remYTn7nUbK4vnqd3aR3D_PbWXWXWokQUgNZXbSlUnlN1mYxskILNpPWmHgZSdK2tVF1mUugupGyoUJBedkoU9ZUZEfJ2db3Y_CfI3HQa8eWus70FLNqVKBA5lmhIopb1A6eeaBWfwxubYaNRtDfDettwzo2rL8b1pOomWw1HNl-SYN-9-MQn-d_RF9eqH-6</recordid><startdate>20160201</startdate><enddate>20160201</enddate><creator>Wang, Zhijian</creator><creator>Yu, Ping</creator><creator>Yan, Nan</creator><creator>Wang, Lan</creator><creator>Ng, Manwa L.</creator><general>Springer US</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-1613-9301</orcidid></search><sort><creationdate>20160201</creationdate><title>Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale</title><author>Wang, Zhijian ; Yu, Ping ; Yan, Nan ; Wang, Lan ; Ng, Manwa L.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Acoustic measurement</topic><topic>Assessments</topic><topic>Automation</topic><topic>Circuits and Systems</topic><topic>Classification</topic><topic>Computer Imaging</topic><topic>Electrical Engineering</topic><topic>Engineering</topic><topic>Image Processing and Computer Vision</topic><topic>Neural networks</topic><topic>Pattern Recognition</topic><topic>Pattern Recognition and Graphics</topic><topic>Ratings</topic><topic>Signal processing</topic><topic>Signal,Image and Speech Processing</topic><topic>Vision</topic><topic>Voice</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wang, Zhijian</creatorcontrib><creatorcontrib>Yu, Ping</creatorcontrib><creatorcontrib>Yan, Nan</creatorcontrib><creatorcontrib>Wang, Lan</creatorcontrib><creatorcontrib>Ng, Manwa L.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of signal processing systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, Zhijian</au><au>Yu, Ping</au><au>Yan, Nan</au><au>Wang, Lan</au><au>Ng, Manwa L.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale</atitle><jtitle>Journal of signal processing systems</jtitle><stitle>J Sign Process Syst</stitle><date>2016-02-01</date><risdate>2016</risdate><volume>82</volume><issue>2</issue><spage>241</spage><epage>251</epage><pages>241-251</pages><issn>1939-8018</issn><eissn>1939-8115</eissn><abstract>Despite the fact that perceptual evaluation is considered as a gold standard for assessing pathological voice quality, the considerably high inter- and intra-listeners variability associated with different perceptual ratings cannot be ignored. This is probably due to other confounding factors such as listeners’ perceptual bias, listeners’ experience and type of rating scale being used. Automatic objective assessment can serve as a useful tool for diagnosis of pathological voices. Acoustic analysis can be useful in determining severity of dysphonia. The present study aimed to develop a complementary automatic voice assessment system by using multidimensional acoustical measures based on the well-known GRBAS perceptual rating scale. A total of 65 dimensionality measures including traditional acoustic methods, MFCC, Glottal-to-Noise Excitation Methods and nonlinear dynamical analysis were used to compose a matrix of features. To reduce redundancy in features, four different feature extraction techniques were applied. The multiclass classification was carried out by means of RBF kernel-SVM and Extreme Learning Machine. The classification results were moderately correlated with GRBAS ratings of severity, with the best accuracy around 77.55 and 80.58 %, respectively. This suggests that such multidimensional acoustic analysis can be an appropriate assessment tool in determining the presence and severity of voice disorders.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11265-015-1016-2</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0002-1613-9301</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1939-8018
ispartof Journal of signal processing systems, 2016-02, Vol.82 (2), p.241-251
issn 1939-8018
1939-8115
language eng
recordid cdi_proquest_miscellaneous_1808056378
source Springer Link
subjects Acoustic measurement
Assessments
Automation
Circuits and Systems
Classification
Computer Imaging
Electrical Engineering
Engineering
Image Processing and Computer Vision
Neural networks
Pattern Recognition
Pattern Recognition and Graphics
Ratings
Signal processing
Signal,Image and Speech Processing
Vision
Voice
title Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T15%3A37%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automatic%20Assessment%20of%20Pathological%20Voice%20Quality%20Using%20Multidimensional%20Acoustic%20Analysis%20Based%20on%20the%20GRBAS%20Scale&rft.jtitle=Journal%20of%20signal%20processing%20systems&rft.au=Wang,%20Zhijian&rft.date=2016-02-01&rft.volume=82&rft.issue=2&rft.spage=241&rft.epage=251&rft.pages=241-251&rft.issn=1939-8018&rft.eissn=1939-8115&rft_id=info:doi/10.1007/s11265-015-1016-2&rft_dat=%3Cproquest_cross%3E1808056378%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1808056378&rft_id=info:pmid/&rfr_iscdi=true