Loading…

Tabby Talks: An automated tool for the assessment of childhood apraxia of speech

•Automated tool to assess productions from children with apraxia of speech.•Consists of clinician interface, mobile application and speech processing engine.•Automatically detects groping errors, articulation errors and prosodic errors.•Lattice-based Pronunciation Verification module detects articul...

Full description

Saved in:
Bibliographic Details
Published in:Speech communication 2015-06, Vol.70, p.49-64
Main Authors: Shahin, Mostafa, Ahmed, Beena, Parnandi, Avinash, Karappa, Virendra, McKechnie, Jacqueline, Ballard, Kirrie J., Gutierrez-Osuna, Ricardo
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•Automated tool to assess productions from children with apraxia of speech.•Consists of clinician interface, mobile application and speech processing engine.•Automatically detects groping errors, articulation errors and prosodic errors.•Lattice-based Pronunciation Verification module detects articulation errors.•Lexical Stress Pattern Verification module detects prosodic errors. Children with developmental disabilities such as childhood apraxia of speech (CAS) require repeated intervention sessions with a speech therapist, sometimes extending over several years. Technology-based therapy tools offer the potential to reduce the demanding workload of speech therapists as well as time and cost for families. In response to this need, we have developed “Tabby Talks,” a multi-tier system for remote administration of speech therapy. This paper describes the speech processing pipeline to automatically detect common errors associated with CAS. The pipeline contains modules for voice activity detection, pronunciation verification, and lexical stress verification. The voice activity detector evaluates the intensity contour of an utterance and compares it against an adaptive threshold to detect silence segments and measure voicing delays and total production time. The pronunciation verification module uses a generic search lattice structure with multiple internal paths that covers all possible pronunciation errors (substitutions, insertions and deletions) in the child’s production. Finally, the lexical stress verification module classifies the lexical stress across consecutive syllables into strong–weak or weak-strong patterns using a combination of prosodic and spectral measures. These error measures can be provided to the therapist through a web interface, to enable them to adapt the child’s therapy program remotely. When evaluated on a dataset of typically developing and disordered speech from children ages 4–16years, the system achieves a pronunciation verification accuracy of 88.2% at the phoneme level and 80.7% at the utterance level, and lexical stress classification rate of 83.3%.
ISSN:0167-6393
1872-7182
DOI:10.1016/j.specom.2015.04.002