Loading…
Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale
Despite the fact that perceptual evaluation is considered as a gold standard for assessing pathological voice quality, the considerably high inter- and intra-listeners variability associated with different perceptual ratings cannot be ignored. This is probably due to other confounding factors such a...
Saved in:
Published in: | Journal of signal processing systems 2016-02, Vol.82 (2), p.241-251 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73 |
---|---|
cites | cdi_FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73 |
container_end_page | 251 |
container_issue | 2 |
container_start_page | 241 |
container_title | Journal of signal processing systems |
container_volume | 82 |
creator | Wang, Zhijian Yu, Ping Yan, Nan Wang, Lan Ng, Manwa L. |
description | Despite the fact that perceptual evaluation is considered as a gold standard for assessing pathological voice quality, the considerably high inter- and intra-listeners variability associated with different perceptual ratings cannot be ignored. This is probably due to other confounding factors such as listeners’ perceptual bias, listeners’ experience and type of rating scale being used. Automatic objective assessment can serve as a useful tool for diagnosis of pathological voices. Acoustic analysis can be useful in determining severity of dysphonia. The present study aimed to develop a complementary automatic voice assessment system by using multidimensional acoustical measures based on the well-known GRBAS perceptual rating scale. A total of 65 dimensionality measures including traditional acoustic methods, MFCC, Glottal-to-Noise Excitation Methods and nonlinear dynamical analysis were used to compose a matrix of features. To reduce redundancy in features, four different feature extraction techniques were applied. The multiclass classification was carried out by means of RBF kernel-SVM and Extreme Learning Machine. The classification results were moderately correlated with GRBAS ratings of severity, with the best accuracy around 77.55 and 80.58 %, respectively. This suggests that such multidimensional acoustic analysis can be an appropriate assessment tool in determining the presence and severity of voice disorders. |
doi_str_mv | 10.1007/s11265-015-1016-2 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1808056378</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1808056378</sourcerecordid><originalsourceid>FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73</originalsourceid><addsrcrecordid>eNp9kEtPwzAQhCMEEs8fwM1HLoHdBifOMa2gIIF4lqvlOJvWKI0h6xz67zEUrpx2V_pmNDtJcopwjgDFBSNOcpkCyhQB83SykxxgmZWpQpS7fzug2k8Omd8BcigkHiShGoNfm-CsqJiJeU19EL4VjyasfOeXzppOvHlnSTyNpnNhIxbs-qW4H7vgGhd5dr6PUGX9yD9G8dqwYzE1TI3wvQgrEvPnafUiXqIdHSd7remYTn7nUbK4vnqd3aR3D_PbWXWXWokQUgNZXbSlUnlN1mYxskILNpPWmHgZSdK2tVF1mUugupGyoUJBedkoU9ZUZEfJ2db3Y_CfI3HQa8eWus70FLNqVKBA5lmhIopb1A6eeaBWfwxubYaNRtDfDettwzo2rL8b1pOomWw1HNl-SYN-9-MQn-d_RF9eqH-6</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1808056378</pqid></control><display><type>article</type><title>Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale</title><source>Springer Link</source><creator>Wang, Zhijian ; Yu, Ping ; Yan, Nan ; Wang, Lan ; Ng, Manwa L.</creator><creatorcontrib>Wang, Zhijian ; Yu, Ping ; Yan, Nan ; Wang, Lan ; Ng, Manwa L.</creatorcontrib><description>Despite the fact that perceptual evaluation is considered as a gold standard for assessing pathological voice quality, the considerably high inter- and intra-listeners variability associated with different perceptual ratings cannot be ignored. This is probably due to other confounding factors such as listeners’ perceptual bias, listeners’ experience and type of rating scale being used. Automatic objective assessment can serve as a useful tool for diagnosis of pathological voices. Acoustic analysis can be useful in determining severity of dysphonia. The present study aimed to develop a complementary automatic voice assessment system by using multidimensional acoustical measures based on the well-known GRBAS perceptual rating scale. A total of 65 dimensionality measures including traditional acoustic methods, MFCC, Glottal-to-Noise Excitation Methods and nonlinear dynamical analysis were used to compose a matrix of features. To reduce redundancy in features, four different feature extraction techniques were applied. The multiclass classification was carried out by means of RBF kernel-SVM and Extreme Learning Machine. The classification results were moderately correlated with GRBAS ratings of severity, with the best accuracy around 77.55 and 80.58 %, respectively. This suggests that such multidimensional acoustic analysis can be an appropriate assessment tool in determining the presence and severity of voice disorders.</description><identifier>ISSN: 1939-8018</identifier><identifier>EISSN: 1939-8115</identifier><identifier>DOI: 10.1007/s11265-015-1016-2</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Acoustic measurement ; Assessments ; Automation ; Circuits and Systems ; Classification ; Computer Imaging ; Electrical Engineering ; Engineering ; Image Processing and Computer Vision ; Neural networks ; Pattern Recognition ; Pattern Recognition and Graphics ; Ratings ; Signal processing ; Signal,Image and Speech Processing ; Vision ; Voice</subject><ispartof>Journal of signal processing systems, 2016-02, Vol.82 (2), p.241-251</ispartof><rights>Springer Science+Business Media New York 2015</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73</citedby><cites>FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73</cites><orcidid>0000-0002-1613-9301</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Wang, Zhijian</creatorcontrib><creatorcontrib>Yu, Ping</creatorcontrib><creatorcontrib>Yan, Nan</creatorcontrib><creatorcontrib>Wang, Lan</creatorcontrib><creatorcontrib>Ng, Manwa L.</creatorcontrib><title>Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale</title><title>Journal of signal processing systems</title><addtitle>J Sign Process Syst</addtitle><description>Despite the fact that perceptual evaluation is considered as a gold standard for assessing pathological voice quality, the considerably high inter- and intra-listeners variability associated with different perceptual ratings cannot be ignored. This is probably due to other confounding factors such as listeners’ perceptual bias, listeners’ experience and type of rating scale being used. Automatic objective assessment can serve as a useful tool for diagnosis of pathological voices. Acoustic analysis can be useful in determining severity of dysphonia. The present study aimed to develop a complementary automatic voice assessment system by using multidimensional acoustical measures based on the well-known GRBAS perceptual rating scale. A total of 65 dimensionality measures including traditional acoustic methods, MFCC, Glottal-to-Noise Excitation Methods and nonlinear dynamical analysis were used to compose a matrix of features. To reduce redundancy in features, four different feature extraction techniques were applied. The multiclass classification was carried out by means of RBF kernel-SVM and Extreme Learning Machine. The classification results were moderately correlated with GRBAS ratings of severity, with the best accuracy around 77.55 and 80.58 %, respectively. This suggests that such multidimensional acoustic analysis can be an appropriate assessment tool in determining the presence and severity of voice disorders.</description><subject>Acoustic measurement</subject><subject>Assessments</subject><subject>Automation</subject><subject>Circuits and Systems</subject><subject>Classification</subject><subject>Computer Imaging</subject><subject>Electrical Engineering</subject><subject>Engineering</subject><subject>Image Processing and Computer Vision</subject><subject>Neural networks</subject><subject>Pattern Recognition</subject><subject>Pattern Recognition and Graphics</subject><subject>Ratings</subject><subject>Signal processing</subject><subject>Signal,Image and Speech Processing</subject><subject>Vision</subject><subject>Voice</subject><issn>1939-8018</issn><issn>1939-8115</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><recordid>eNp9kEtPwzAQhCMEEs8fwM1HLoHdBifOMa2gIIF4lqvlOJvWKI0h6xz67zEUrpx2V_pmNDtJcopwjgDFBSNOcpkCyhQB83SykxxgmZWpQpS7fzug2k8Omd8BcigkHiShGoNfm-CsqJiJeU19EL4VjyasfOeXzppOvHlnSTyNpnNhIxbs-qW4H7vgGhd5dr6PUGX9yD9G8dqwYzE1TI3wvQgrEvPnafUiXqIdHSd7remYTn7nUbK4vnqd3aR3D_PbWXWXWokQUgNZXbSlUnlN1mYxskILNpPWmHgZSdK2tVF1mUugupGyoUJBedkoU9ZUZEfJ2db3Y_CfI3HQa8eWus70FLNqVKBA5lmhIopb1A6eeaBWfwxubYaNRtDfDettwzo2rL8b1pOomWw1HNl-SYN-9-MQn-d_RF9eqH-6</recordid><startdate>20160201</startdate><enddate>20160201</enddate><creator>Wang, Zhijian</creator><creator>Yu, Ping</creator><creator>Yan, Nan</creator><creator>Wang, Lan</creator><creator>Ng, Manwa L.</creator><general>Springer US</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-1613-9301</orcidid></search><sort><creationdate>20160201</creationdate><title>Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale</title><author>Wang, Zhijian ; Yu, Ping ; Yan, Nan ; Wang, Lan ; Ng, Manwa L.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Acoustic measurement</topic><topic>Assessments</topic><topic>Automation</topic><topic>Circuits and Systems</topic><topic>Classification</topic><topic>Computer Imaging</topic><topic>Electrical Engineering</topic><topic>Engineering</topic><topic>Image Processing and Computer Vision</topic><topic>Neural networks</topic><topic>Pattern Recognition</topic><topic>Pattern Recognition and Graphics</topic><topic>Ratings</topic><topic>Signal processing</topic><topic>Signal,Image and Speech Processing</topic><topic>Vision</topic><topic>Voice</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wang, Zhijian</creatorcontrib><creatorcontrib>Yu, Ping</creatorcontrib><creatorcontrib>Yan, Nan</creatorcontrib><creatorcontrib>Wang, Lan</creatorcontrib><creatorcontrib>Ng, Manwa L.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of signal processing systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, Zhijian</au><au>Yu, Ping</au><au>Yan, Nan</au><au>Wang, Lan</au><au>Ng, Manwa L.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale</atitle><jtitle>Journal of signal processing systems</jtitle><stitle>J Sign Process Syst</stitle><date>2016-02-01</date><risdate>2016</risdate><volume>82</volume><issue>2</issue><spage>241</spage><epage>251</epage><pages>241-251</pages><issn>1939-8018</issn><eissn>1939-8115</eissn><abstract>Despite the fact that perceptual evaluation is considered as a gold standard for assessing pathological voice quality, the considerably high inter- and intra-listeners variability associated with different perceptual ratings cannot be ignored. This is probably due to other confounding factors such as listeners’ perceptual bias, listeners’ experience and type of rating scale being used. Automatic objective assessment can serve as a useful tool for diagnosis of pathological voices. Acoustic analysis can be useful in determining severity of dysphonia. The present study aimed to develop a complementary automatic voice assessment system by using multidimensional acoustical measures based on the well-known GRBAS perceptual rating scale. A total of 65 dimensionality measures including traditional acoustic methods, MFCC, Glottal-to-Noise Excitation Methods and nonlinear dynamical analysis were used to compose a matrix of features. To reduce redundancy in features, four different feature extraction techniques were applied. The multiclass classification was carried out by means of RBF kernel-SVM and Extreme Learning Machine. The classification results were moderately correlated with GRBAS ratings of severity, with the best accuracy around 77.55 and 80.58 %, respectively. This suggests that such multidimensional acoustic analysis can be an appropriate assessment tool in determining the presence and severity of voice disorders.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11265-015-1016-2</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0002-1613-9301</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1939-8018 |
ispartof | Journal of signal processing systems, 2016-02, Vol.82 (2), p.241-251 |
issn | 1939-8018 1939-8115 |
language | eng |
recordid | cdi_proquest_miscellaneous_1808056378 |
source | Springer Link |
subjects | Acoustic measurement Assessments Automation Circuits and Systems Classification Computer Imaging Electrical Engineering Engineering Image Processing and Computer Vision Neural networks Pattern Recognition Pattern Recognition and Graphics Ratings Signal processing Signal,Image and Speech Processing Vision Voice |
title | Automatic Assessment of Pathological Voice Quality Using Multidimensional Acoustic Analysis Based on the GRBAS Scale |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T15%3A37%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automatic%20Assessment%20of%20Pathological%20Voice%20Quality%20Using%20Multidimensional%20Acoustic%20Analysis%20Based%20on%20the%20GRBAS%20Scale&rft.jtitle=Journal%20of%20signal%20processing%20systems&rft.au=Wang,%20Zhijian&rft.date=2016-02-01&rft.volume=82&rft.issue=2&rft.spage=241&rft.epage=251&rft.pages=241-251&rft.issn=1939-8018&rft.eissn=1939-8115&rft_id=info:doi/10.1007/s11265-015-1016-2&rft_dat=%3Cproquest_cross%3E1808056378%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c510t-a03b7f9886becc306081c0c35caa306a5e5cfba8b9650ebd55de78094d8a9be73%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=1808056378&rft_id=info:pmid/&rfr_iscdi=true |