Loading…

Variable STFT Layered CNN Model for Automated Dysarthria Detection and Severity Assessment Using Raw Speech

This paper presents a novel approach for automated dysarthria detection and severity assessment using a variable short-time Fourier transform layered convolutional neural networks (CNN) model. Dysarthria is a speech disorder characterized by difficulties in articulation, resulting in unclear speech....

Full description

Saved in:

Bibliographic Details
Published in:	Circuits, systems, and signal processing systems, and signal processing, 2024-05, Vol.43 (5), p.3261-3278
Main Authors:	Radha, Kodali, Bansal, Mohan, Dhulipalla, Venkata Rao
Format:	Article
Language:	English
Subjects:	Accuracy Artificial neural networks Automation Circuits and Systems Communication Datasets Deep learning Dysarthria Electrical Engineering Electronics and Microelectronics Engineering Fourier transforms Instrumentation Neural networks Signal processing Signal,Image and Speech Processing Speech Speech disorders
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	This paper presents a novel approach for automated dysarthria detection and severity assessment using a variable short-time Fourier transform layered convolutional neural networks (CNN) model. Dysarthria is a speech disorder characterized by difficulties in articulation, resulting in unclear speech. The model is evaluated on two datasets, TORGO and UA-Speech, consisting of individuals with dysarthria and healthy controls. Various variations of the CNN’s first layer, including spectrogram, log spectrogram, and pre-emphasis filtering (PEF) with and without learnables, are investigated. Notably, the PEF with 5 learnables achieves the highest accuracy in detecting dysarthria and assessing its severity. The study highlights the significance of dataset size, with UA-Speech dataset showing superior performance due to its larger size, enabling better capture of dysarthria severity variations. This research contributes to the advancement of objective dysarthria assessment, aiding in early diagnosis and personalized treatment for individuals with speech disorders.
ISSN:	0278-081X 1531-5878
DOI:	10.1007/s00034-024-02611-7