Loading…

Artificial Neural Network Based Amharic Language Speaker Recognition

In this artificial intelligence time, speaker recognition is the most useful biometric recognition technique. Security is a big issue that needs careful attention because of every activities have been becoming automated and internet based. For security purpose, unique features of authorized user are...

Full description

Saved in:
Bibliographic Details
Published in:Turkish journal of computer and mathematics education 2021-04, Vol.12 (3), p.5105-5116
Main Authors: Gebre, Gizachew Belayneh, Urgessa, Teklu, Gopikrishna, T
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 5116
container_issue 3
container_start_page 5105
container_title Turkish journal of computer and mathematics education
container_volume 12
creator Gebre, Gizachew Belayneh
Urgessa, Teklu
Gopikrishna, T
description In this artificial intelligence time, speaker recognition is the most useful biometric recognition technique. Security is a big issue that needs careful attention because of every activities have been becoming automated and internet based. For security purpose, unique features of authorized user are highly needed. Voice is one of the wonderful unique biometric features. So, developing speaker recognition based on scientific research is the most concerned issue. Nowadays, criminal activities are increasing day to day in different clever way. So, every country should have strengthen forensic investigation using such technologies. The study was done by inspiration of contextualizing this concept for our country. In this study, text-independent Amharic language speaker recognition model was developed using Mel-Frequency Cepstral Coefficients to extract features from preprocessed speech signals and Artificial Neural Network to model the feature vector obtained from the Mel-Frequency Cepstral Coefficients and to classify objects while testing. The researcher used 20 sampled speeches of 10 each speaker (total of 200 speech samples) for training and testing separately. By setting the number of hidden neurons to 15, 20, and 25, three different models have been developed and evaluated for accuracy. The fourth-generation high-level programming language and interactive environment MATLAB is used to conduct the overall study implementations. At the end, very promising findings have been obtained. The study achieved better performance than other related researches which used Vector Quantization and Gaussian Mixture Model modelling techniques. Implementable result could obtain for the future by increasing number of speakers and speech samples and including the four Amharic accents.
doi_str_mv 10.17762/turcomat.v12i3.2043
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2625344121</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2625344121</sourcerecordid><originalsourceid>FETCH-LOGICAL-c143t-33ca971a7e8f7f2b49ae43cd5bcd48d1172405d1cb9827ca343c440e87974f103</originalsourceid><addsrcrecordid>eNpNkFtLAzEUhIMoWLT_wIcFn7fmJGc3u4-1XqEoeHkOp9lsTS-bmmQV_721VfBpBmaYgY-xM-AjUKoUF6kPxq8pjT5AODkSHOUBG4DkdY5lIQ__-WM2jHHBOYdCYVWVA3Y1Dsm1zjhaZQ-2DztJnz4ss0uKtsnG6zcKzmRT6uY9zW32vLG0tCF7ssbPO5ec707ZUUuraIe_esJeb65fJnf59PH2fjKe5gZQplxKQ7UCUrZqVStmWJNFaZpiZhqsGgAlkBcNmFldCWVIbkNEbitVK2yByxN2vt_dBP_e25j0wveh215qUYpCIoKAbQv3LRN8jMG2ehPcmsKXBq53yPQfMr1Dpn-QyW_uZmFm</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2625344121</pqid></control><display><type>article</type><title>Artificial Neural Network Based Amharic Language Speaker Recognition</title><source>Education Collection (Proquest) (PQ_SDU_P3)</source><source>Publicly Available Content Database</source><source>Social Science Premium Collection</source><creator>Gebre, Gizachew Belayneh ; Urgessa, Teklu ; Gopikrishna, T</creator><creatorcontrib>Gebre, Gizachew Belayneh ; Urgessa, Teklu ; Gopikrishna, T</creatorcontrib><description>In this artificial intelligence time, speaker recognition is the most useful biometric recognition technique. Security is a big issue that needs careful attention because of every activities have been becoming automated and internet based. For security purpose, unique features of authorized user are highly needed. Voice is one of the wonderful unique biometric features. So, developing speaker recognition based on scientific research is the most concerned issue. Nowadays, criminal activities are increasing day to day in different clever way. So, every country should have strengthen forensic investigation using such technologies. The study was done by inspiration of contextualizing this concept for our country. In this study, text-independent Amharic language speaker recognition model was developed using Mel-Frequency Cepstral Coefficients to extract features from preprocessed speech signals and Artificial Neural Network to model the feature vector obtained from the Mel-Frequency Cepstral Coefficients and to classify objects while testing. The researcher used 20 sampled speeches of 10 each speaker (total of 200 speech samples) for training and testing separately. By setting the number of hidden neurons to 15, 20, and 25, three different models have been developed and evaluated for accuracy. The fourth-generation high-level programming language and interactive environment MATLAB is used to conduct the overall study implementations. At the end, very promising findings have been obtained. The study achieved better performance than other related researches which used Vector Quantization and Gaussian Mixture Model modelling techniques. Implementable result could obtain for the future by increasing number of speakers and speech samples and including the four Amharic accents.</description><identifier>ISSN: 1309-4653</identifier><identifier>EISSN: 1309-4653</identifier><identifier>DOI: 10.17762/turcomat.v12i3.2043</identifier><language>eng</language><publisher>Gurgaon: Ninety Nine Publication</publisher><subject>Artificial intelligence ; Artificial neural networks ; Biometrics ; Crime ; Feature extraction ; Feature recognition ; High level languages ; Neural networks ; Probabilistic models ; Programming languages ; Scientific Research ; Security ; Semitic Languages ; Speech ; Speech recognition ; Vector quantization</subject><ispartof>Turkish journal of computer and mathematics education, 2021-04, Vol.12 (3), p.5105-5116</ispartof><rights>2021. This work is published under https://creativecommons.org/licenses/by/4.0 (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2625344121?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,21358,21374,25732,27903,27904,33590,33856,36991,43712,43859,44569</link.rule.ids></links><search><creatorcontrib>Gebre, Gizachew Belayneh</creatorcontrib><creatorcontrib>Urgessa, Teklu</creatorcontrib><creatorcontrib>Gopikrishna, T</creatorcontrib><title>Artificial Neural Network Based Amharic Language Speaker Recognition</title><title>Turkish journal of computer and mathematics education</title><description>In this artificial intelligence time, speaker recognition is the most useful biometric recognition technique. Security is a big issue that needs careful attention because of every activities have been becoming automated and internet based. For security purpose, unique features of authorized user are highly needed. Voice is one of the wonderful unique biometric features. So, developing speaker recognition based on scientific research is the most concerned issue. Nowadays, criminal activities are increasing day to day in different clever way. So, every country should have strengthen forensic investigation using such technologies. The study was done by inspiration of contextualizing this concept for our country. In this study, text-independent Amharic language speaker recognition model was developed using Mel-Frequency Cepstral Coefficients to extract features from preprocessed speech signals and Artificial Neural Network to model the feature vector obtained from the Mel-Frequency Cepstral Coefficients and to classify objects while testing. The researcher used 20 sampled speeches of 10 each speaker (total of 200 speech samples) for training and testing separately. By setting the number of hidden neurons to 15, 20, and 25, three different models have been developed and evaluated for accuracy. The fourth-generation high-level programming language and interactive environment MATLAB is used to conduct the overall study implementations. At the end, very promising findings have been obtained. The study achieved better performance than other related researches which used Vector Quantization and Gaussian Mixture Model modelling techniques. Implementable result could obtain for the future by increasing number of speakers and speech samples and including the four Amharic accents.</description><subject>Artificial intelligence</subject><subject>Artificial neural networks</subject><subject>Biometrics</subject><subject>Crime</subject><subject>Feature extraction</subject><subject>Feature recognition</subject><subject>High level languages</subject><subject>Neural networks</subject><subject>Probabilistic models</subject><subject>Programming languages</subject><subject>Scientific Research</subject><subject>Security</subject><subject>Semitic Languages</subject><subject>Speech</subject><subject>Speech recognition</subject><subject>Vector quantization</subject><issn>1309-4653</issn><issn>1309-4653</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ALSLI</sourceid><sourceid>CJNVE</sourceid><sourceid>M0P</sourceid><sourceid>PIMPY</sourceid><recordid>eNpNkFtLAzEUhIMoWLT_wIcFn7fmJGc3u4-1XqEoeHkOp9lsTS-bmmQV_721VfBpBmaYgY-xM-AjUKoUF6kPxq8pjT5AODkSHOUBG4DkdY5lIQ__-WM2jHHBOYdCYVWVA3Y1Dsm1zjhaZQ-2DztJnz4ss0uKtsnG6zcKzmRT6uY9zW32vLG0tCF7ssbPO5ec707ZUUuraIe_esJeb65fJnf59PH2fjKe5gZQplxKQ7UCUrZqVStmWJNFaZpiZhqsGgAlkBcNmFldCWVIbkNEbitVK2yByxN2vt_dBP_e25j0wveh215qUYpCIoKAbQv3LRN8jMG2ehPcmsKXBq53yPQfMr1Dpn-QyW_uZmFm</recordid><startdate>20210411</startdate><enddate>20210411</enddate><creator>Gebre, Gizachew Belayneh</creator><creator>Urgessa, Teklu</creator><creator>Gopikrishna, T</creator><general>Ninety Nine Publication</general><scope>AAYXX</scope><scope>CITATION</scope><scope>0-V</scope><scope>3V.</scope><scope>7XB</scope><scope>88B</scope><scope>88I</scope><scope>8AL</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>CJNVE</scope><scope>DWQXO</scope><scope>EDSIH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M0N</scope><scope>M0P</scope><scope>M2P</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEDU</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>20210411</creationdate><title>Artificial Neural Network Based Amharic Language Speaker Recognition</title><author>Gebre, Gizachew Belayneh ; Urgessa, Teklu ; Gopikrishna, T</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c143t-33ca971a7e8f7f2b49ae43cd5bcd48d1172405d1cb9827ca343c440e87974f103</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Artificial intelligence</topic><topic>Artificial neural networks</topic><topic>Biometrics</topic><topic>Crime</topic><topic>Feature extraction</topic><topic>Feature recognition</topic><topic>High level languages</topic><topic>Neural networks</topic><topic>Probabilistic models</topic><topic>Programming languages</topic><topic>Scientific Research</topic><topic>Security</topic><topic>Semitic Languages</topic><topic>Speech</topic><topic>Speech recognition</topic><topic>Vector quantization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gebre, Gizachew Belayneh</creatorcontrib><creatorcontrib>Urgessa, Teklu</creatorcontrib><creatorcontrib>Gopikrishna, T</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Social Sciences Premium Collection【Remote access available】</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Education Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>Advanced Technologies &amp; Aerospace Database‎ (1962 - current)</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>Education Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Central</collection><collection>Turkey Database</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Computing Database</collection><collection>ProQuest Education Journals</collection><collection>ProQuest Science Journals</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Education</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>Turkish journal of computer and mathematics education</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gebre, Gizachew Belayneh</au><au>Urgessa, Teklu</au><au>Gopikrishna, T</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Artificial Neural Network Based Amharic Language Speaker Recognition</atitle><jtitle>Turkish journal of computer and mathematics education</jtitle><date>2021-04-11</date><risdate>2021</risdate><volume>12</volume><issue>3</issue><spage>5105</spage><epage>5116</epage><pages>5105-5116</pages><issn>1309-4653</issn><eissn>1309-4653</eissn><abstract>In this artificial intelligence time, speaker recognition is the most useful biometric recognition technique. Security is a big issue that needs careful attention because of every activities have been becoming automated and internet based. For security purpose, unique features of authorized user are highly needed. Voice is one of the wonderful unique biometric features. So, developing speaker recognition based on scientific research is the most concerned issue. Nowadays, criminal activities are increasing day to day in different clever way. So, every country should have strengthen forensic investigation using such technologies. The study was done by inspiration of contextualizing this concept for our country. In this study, text-independent Amharic language speaker recognition model was developed using Mel-Frequency Cepstral Coefficients to extract features from preprocessed speech signals and Artificial Neural Network to model the feature vector obtained from the Mel-Frequency Cepstral Coefficients and to classify objects while testing. The researcher used 20 sampled speeches of 10 each speaker (total of 200 speech samples) for training and testing separately. By setting the number of hidden neurons to 15, 20, and 25, three different models have been developed and evaluated for accuracy. The fourth-generation high-level programming language and interactive environment MATLAB is used to conduct the overall study implementations. At the end, very promising findings have been obtained. The study achieved better performance than other related researches which used Vector Quantization and Gaussian Mixture Model modelling techniques. Implementable result could obtain for the future by increasing number of speakers and speech samples and including the four Amharic accents.</abstract><cop>Gurgaon</cop><pub>Ninety Nine Publication</pub><doi>10.17762/turcomat.v12i3.2043</doi><tpages>12</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1309-4653
ispartof Turkish journal of computer and mathematics education, 2021-04, Vol.12 (3), p.5105-5116
issn 1309-4653
1309-4653
language eng
recordid cdi_proquest_journals_2625344121
source Education Collection (Proquest) (PQ_SDU_P3); Publicly Available Content Database; Social Science Premium Collection
subjects Artificial intelligence
Artificial neural networks
Biometrics
Crime
Feature extraction
Feature recognition
High level languages
Neural networks
Probabilistic models
Programming languages
Scientific Research
Security
Semitic Languages
Speech
Speech recognition
Vector quantization
title Artificial Neural Network Based Amharic Language Speaker Recognition
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T17%3A04%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Artificial%20Neural%20Network%20Based%20Amharic%20Language%20Speaker%20Recognition&rft.jtitle=Turkish%20journal%20of%20computer%20and%20mathematics%20education&rft.au=Gebre,%20Gizachew%20Belayneh&rft.date=2021-04-11&rft.volume=12&rft.issue=3&rft.spage=5105&rft.epage=5116&rft.pages=5105-5116&rft.issn=1309-4653&rft.eissn=1309-4653&rft_id=info:doi/10.17762/turcomat.v12i3.2043&rft_dat=%3Cproquest_cross%3E2625344121%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c143t-33ca971a7e8f7f2b49ae43cd5bcd48d1172405d1cb9827ca343c440e87974f103%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2625344121&rft_id=info:pmid/&rfr_iscdi=true