Loading…
Automatic speaker independent dysarthric speech intelligibility assessment system
•A speaker independent dysarthria speech assessment system for automatic speech intelligibility scoring system.•Assessment technique exploits the raw output of an end-to-end speech to alphabet recognition engine.•Predict’s intelligibility scores that are highly correlated to the perceptual intelligi...
Saved in:
Published in: | Computer speech & language 2021-09, Vol.69, p.101213, Article 101213 |
---|---|
Main Authors: | , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •A speaker independent dysarthria speech assessment system for automatic speech intelligibility scoring system.•Assessment technique exploits the raw output of an end-to-end speech to alphabet recognition engine.•Predict’s intelligibility scores that are highly correlated to the perceptual intelligibility scores understood by human.•Automatic selection of optimal number of words, using a cost minimization approach, for intelligibility assessment.•Patient friendly; a small set of words need to be uttered by the patient unlike what is available in the literature.•Visual speech to determine the effort required to utter a word; used in the cost function to find optimal set of words.
Dysarthria is a condition which hampers the ability of an individual to control the muscles that play a major role in speech delivery. The loss of fine control over muscles that assist the movement of lips, vocal chords, tongue and diaphragm results in abnormal speech delivery. One can assess the severity level of dysarthria by analyzing the intelligibility of speech spoken by an individual. Continuous intelligibility assessment helps speech language pathologists not only study the impact of medication but also allows them to plan personalized therapy. It helps the clinicians immensely if the intelligibility assessment system is reliable, automatic, simple for (a) patients to undergo and (b) clinicians to interpret. Lack of availability of dysarthric data has resulted in development of speaker dependentautomatic intelligibility assessment systems which requires patients to speak a large number of utterances. In this paper, we propose (a) a cost minimization procedure to select an optimal (small) number of utterances that need to be spoken by the dysarthric patient, (b) four different speaker independent intelligibility assessment systems which require the patient to speak a small number of words, and (c) the assessment score is close to the perceptual score that the Speech Language Pathologist (SLP) can relate to. The need for small number of utterances to be spoken by the patient and the score being relatable to the SLP benefits both the dysarthric patient and the clinician from usability perspective. |
---|---|
ISSN: | 0885-2308 1095-8363 |
DOI: | 10.1016/j.csl.2021.101213 |