Loading…

Tunisian Dialectal End-to-end Speech Recognition based on DeepSpeech

Recognize automatically the spontaneous Human speech and transcribe it into text is becoming an important task. However, freely available models are rare especially for under-resourced languages and dialects since they require large amounts of data in order to achieve high performances. This paper d...

Full description

Saved in:
Bibliographic Details
Published in:Procedia computer science 2021, Vol.189, p.183-190
Main Authors: Messaoudi, Abir, Haddad, Hatem, Fourati, Chayma, Hmida, Moez BenHaj, Elhaj Mabrouk, Aymen Ben, Graiet, Mohamed
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Recognize automatically the spontaneous Human speech and transcribe it into text is becoming an important task. However, freely available models are rare especially for under-resourced languages and dialects since they require large amounts of data in order to achieve high performances. This paper describes an approach to build an end-to-end Tunisian dialect speech system based on deep learning. For this propose, a Tunisian dialect paired text-speech dataset called "TunSpeech" was created. Existing Modern Standard Arabic (MSA) speech data was also combined with dialectal Tunisian data and decreased the Out-Of-Vocabulary rate and improve perplexity. On the other hand, synthetic dialectal data from a text to speech increased the Word Error Rate.
ISSN:1877-0509
1877-0509
DOI:10.1016/j.procs.2021.05.082