Loading…

Identifying Low-Resource Languages in Speech Recordings through Deep Learning

The aim of this paper is to build a system that identifies a low resource language, like the Albanian language, in speech recordings. Our proposed system is based on the conversion of audio signals into spectrograms. We have built 2 models for the identification of spoken language based on spectrogr...

Full description

Saved in:
Bibliographic Details
Main Authors: Binjaku, Kleona, Janku, Joan, Mece, Elinda Kajo
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The aim of this paper is to build a system that identifies a low resource language, like the Albanian language, in speech recordings. Our proposed system is based on the conversion of audio signals into spectrograms. We have built 2 models for the identification of spoken language based on spectrograms images using Artificial Neural Networks (ANN) and Convolutional Neural Networks (CNN). The dataset with spoken audio signals in the Albanian language, we have built manually. The results are taken based on two languages, but the system works if other languages are added. Both models have shown good capabilities to learn Albanian language patterns from spectrograms and the achieved accuracies are 85% (ANN) and 94% (CNN) respectively. We have studied different cases how spectrograms' color and size impact the performance of our models.
ISSN:1847-358X