Loading…
Tabby Talks: An automated tool for the assessment of childhood apraxia of speech
•Automated tool to assess productions from children with apraxia of speech.•Consists of clinician interface, mobile application and speech processing engine.•Automatically detects groping errors, articulation errors and prosodic errors.•Lattice-based Pronunciation Verification module detects articul...
Saved in:
Published in: | Speech communication 2015-06, Vol.70, p.49-64 |
---|---|
Main Authors: | , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •Automated tool to assess productions from children with apraxia of speech.•Consists of clinician interface, mobile application and speech processing engine.•Automatically detects groping errors, articulation errors and prosodic errors.•Lattice-based Pronunciation Verification module detects articulation errors.•Lexical Stress Pattern Verification module detects prosodic errors.
Children with developmental disabilities such as childhood apraxia of speech (CAS) require repeated intervention sessions with a speech therapist, sometimes extending over several years. Technology-based therapy tools offer the potential to reduce the demanding workload of speech therapists as well as time and cost for families. In response to this need, we have developed “Tabby Talks,” a multi-tier system for remote administration of speech therapy. This paper describes the speech processing pipeline to automatically detect common errors associated with CAS. The pipeline contains modules for voice activity detection, pronunciation verification, and lexical stress verification. The voice activity detector evaluates the intensity contour of an utterance and compares it against an adaptive threshold to detect silence segments and measure voicing delays and total production time. The pronunciation verification module uses a generic search lattice structure with multiple internal paths that covers all possible pronunciation errors (substitutions, insertions and deletions) in the child’s production. Finally, the lexical stress verification module classifies the lexical stress across consecutive syllables into strong–weak or weak-strong patterns using a combination of prosodic and spectral measures. These error measures can be provided to the therapist through a web interface, to enable them to adapt the child’s therapy program remotely. When evaluated on a dataset of typically developing and disordered speech from children ages 4–16years, the system achieves a pronunciation verification accuracy of 88.2% at the phoneme level and 80.7% at the utterance level, and lexical stress classification rate of 83.3%. |
---|---|
ISSN: | 0167-6393 1872-7182 |
DOI: | 10.1016/j.specom.2015.04.002 |