Loading…
Tunisian Dialectal End-to-end Speech Recognition based on DeepSpeech
Recognize automatically the spontaneous Human speech and transcribe it into text is becoming an important task. However, freely available models are rare especially for under-resourced languages and dialects since they require large amounts of data in order to achieve high performances. This paper d...
Saved in:
Published in: | Procedia computer science 2021, Vol.189, p.183-190 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Recognize automatically the spontaneous Human speech and transcribe it into text is becoming an important task. However, freely available models are rare especially for under-resourced languages and dialects since they require large amounts of data in order to achieve high performances. This paper describes an approach to build an end-to-end Tunisian dialect speech system based on deep learning. For this propose, a Tunisian dialect paired text-speech dataset called "TunSpeech" was created. Existing Modern Standard Arabic (MSA) speech data was also combined with dialectal Tunisian data and decreased the Out-Of-Vocabulary rate and improve perplexity. On the other hand, synthetic dialectal data from a text to speech increased the Word Error Rate. |
---|---|
ISSN: | 1877-0509 1877-0509 |
DOI: | 10.1016/j.procs.2021.05.082 |