Loading…

Clustering based emotional speech recognition using fuzzy and K-means for tamil language

Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tami...

Full description

Saved in:
Bibliographic Details
Main Authors: John, Bennilo Fernandes, Kusumanchi, T. P. S. Kumar, Ramamoorthi, Agilesh Saravanan, Ramanathula, Sireesha, Kongala, Raju
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue 1
container_start_page
container_title
container_volume 3028
creator John, Bennilo Fernandes
Kusumanchi, T. P. S. Kumar
Ramamoorthi, Agilesh Saravanan
Ramanathula, Sireesha
Kongala, Raju
description Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tamil emotional database through a comparative analysis of three Fuzzy prototype techniques: FCM, KFCM, and k-means algorithm. The goal is to identify the most efficient approach. The extraction of the emotional speech feature is analyzed via MFCC delta & Spectral Skewness concatenation methodology to get more features for the analysis. For training and testing purposes, both male and female speech samples an emotional Tamil language database was developed with PCA & followed by ICA technique in order to utilize the data for higher order dataset. Emotion detection outcomes have both the benefits and drawbacks of their own approach. Our analysis describes the efficiency of the KFCM methodology shows more than k-means and FCM approaches and analysis it’s clear that the emotions like anger, happy and normal shows higher rate of accuracy than other emotions with overall approximation of 84% of precision and also the time taken for the execution by KFCM is 0.238 mins, where the other methods take slightly higher time for execution.
doi_str_mv 10.1063/5.0213359
format conference_proceeding
fullrecord <record><control><sourceid>proquest_scita</sourceid><recordid>TN_cdi_proquest_journals_3076803782</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3076803782</sourcerecordid><originalsourceid>FETCH-LOGICAL-p133t-982a12cbe37d86e69c715babf5bafcab53366937f906197ebaa6aa5dc9b73fef3</originalsourceid><addsrcrecordid>eNotkEtrwzAQhEVpoWnaQ_-BoLeCU8mKJOtYQl800EsLuZmVvHId_KpkH5JfX5vksgPLxzAzhNxztuJMiSe5YikXQpoLsuBS8kQrri7JgjGzTtK12F2Tmxj3jKVG62xBdpt6jAOGqi2phYgFxaYbqq6FmsYe0f3SgK4r22p-0jHOoB-PxwOFtqCfSYPQRuq7QAdoqprW0JYjlHhLrjzUEe_OuiQ_ry_fm_dk-_X2sXneJv0Uc0hMlgJPnUWhi0yhMk5zacH66XgHVgqhlBHaG6a40WgBFIAsnLFaePRiSR5Ovn3o_kaMQ77vxjDFj7lgWmVM6CydqMcTFV01wFwl70PVQDjknOXzcrnMz8uJf6t_YaM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>3076803782</pqid></control><display><type>conference_proceeding</type><title>Clustering based emotional speech recognition using fuzzy and K-means for tamil language</title><source>American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)</source><creator>John, Bennilo Fernandes ; Kusumanchi, T. P. S. Kumar ; Ramamoorthi, Agilesh Saravanan ; Ramanathula, Sireesha ; Kongala, Raju</creator><contributor>Chennakesavulu, M. ; Ramanjaneyulu, N.</contributor><creatorcontrib>John, Bennilo Fernandes ; Kusumanchi, T. P. S. Kumar ; Ramamoorthi, Agilesh Saravanan ; Ramanathula, Sireesha ; Kongala, Raju ; Chennakesavulu, M. ; Ramanjaneyulu, N.</creatorcontrib><description>Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tamil emotional database through a comparative analysis of three Fuzzy prototype techniques: FCM, KFCM, and k-means algorithm. The goal is to identify the most efficient approach. The extraction of the emotional speech feature is analyzed via MFCC delta &amp; Spectral Skewness concatenation methodology to get more features for the analysis. For training and testing purposes, both male and female speech samples an emotional Tamil language database was developed with PCA &amp; followed by ICA technique in order to utilize the data for higher order dataset. Emotion detection outcomes have both the benefits and drawbacks of their own approach. Our analysis describes the efficiency of the KFCM methodology shows more than k-means and FCM approaches and analysis it’s clear that the emotions like anger, happy and normal shows higher rate of accuracy than other emotions with overall approximation of 84% of precision and also the time taken for the execution by KFCM is 0.238 mins, where the other methods take slightly higher time for execution.</description><identifier>ISSN: 0094-243X</identifier><identifier>EISSN: 1551-7616</identifier><identifier>DOI: 10.1063/5.0213359</identifier><identifier>CODEN: APCPCS</identifier><language>eng</language><publisher>Melville: American Institute of Physics</publisher><subject>Algorithms ; Clustering ; Emotion recognition ; Emotions ; Speech recognition</subject><ispartof>AIP conference proceedings, 2024, Vol.3028 (1)</ispartof><rights>Author(s)</rights><rights>2024 Author(s). Published under an exclusive license by AIP Publishing.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>309,310,314,776,780,785,786,23909,23910,25118,27901,27902</link.rule.ids></links><search><contributor>Chennakesavulu, M.</contributor><contributor>Ramanjaneyulu, N.</contributor><creatorcontrib>John, Bennilo Fernandes</creatorcontrib><creatorcontrib>Kusumanchi, T. P. S. Kumar</creatorcontrib><creatorcontrib>Ramamoorthi, Agilesh Saravanan</creatorcontrib><creatorcontrib>Ramanathula, Sireesha</creatorcontrib><creatorcontrib>Kongala, Raju</creatorcontrib><title>Clustering based emotional speech recognition using fuzzy and K-means for tamil language</title><title>AIP conference proceedings</title><description>Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tamil emotional database through a comparative analysis of three Fuzzy prototype techniques: FCM, KFCM, and k-means algorithm. The goal is to identify the most efficient approach. The extraction of the emotional speech feature is analyzed via MFCC delta &amp; Spectral Skewness concatenation methodology to get more features for the analysis. For training and testing purposes, both male and female speech samples an emotional Tamil language database was developed with PCA &amp; followed by ICA technique in order to utilize the data for higher order dataset. Emotion detection outcomes have both the benefits and drawbacks of their own approach. Our analysis describes the efficiency of the KFCM methodology shows more than k-means and FCM approaches and analysis it’s clear that the emotions like anger, happy and normal shows higher rate of accuracy than other emotions with overall approximation of 84% of precision and also the time taken for the execution by KFCM is 0.238 mins, where the other methods take slightly higher time for execution.</description><subject>Algorithms</subject><subject>Clustering</subject><subject>Emotion recognition</subject><subject>Emotions</subject><subject>Speech recognition</subject><issn>0094-243X</issn><issn>1551-7616</issn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2024</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNotkEtrwzAQhEVpoWnaQ_-BoLeCU8mKJOtYQl800EsLuZmVvHId_KpkH5JfX5vksgPLxzAzhNxztuJMiSe5YikXQpoLsuBS8kQrri7JgjGzTtK12F2Tmxj3jKVG62xBdpt6jAOGqi2phYgFxaYbqq6FmsYe0f3SgK4r22p-0jHOoB-PxwOFtqCfSYPQRuq7QAdoqprW0JYjlHhLrjzUEe_OuiQ_ry_fm_dk-_X2sXneJv0Uc0hMlgJPnUWhi0yhMk5zacH66XgHVgqhlBHaG6a40WgBFIAsnLFaePRiSR5Ovn3o_kaMQ77vxjDFj7lgWmVM6CydqMcTFV01wFwl70PVQDjknOXzcrnMz8uJf6t_YaM</recordid><startdate>20240708</startdate><enddate>20240708</enddate><creator>John, Bennilo Fernandes</creator><creator>Kusumanchi, T. P. S. Kumar</creator><creator>Ramamoorthi, Agilesh Saravanan</creator><creator>Ramanathula, Sireesha</creator><creator>Kongala, Raju</creator><general>American Institute of Physics</general><scope>8FD</scope><scope>H8D</scope><scope>L7M</scope></search><sort><creationdate>20240708</creationdate><title>Clustering based emotional speech recognition using fuzzy and K-means for tamil language</title><author>John, Bennilo Fernandes ; Kusumanchi, T. P. S. Kumar ; Ramamoorthi, Agilesh Saravanan ; Ramanathula, Sireesha ; Kongala, Raju</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p133t-982a12cbe37d86e69c715babf5bafcab53366937f906197ebaa6aa5dc9b73fef3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Clustering</topic><topic>Emotion recognition</topic><topic>Emotions</topic><topic>Speech recognition</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>John, Bennilo Fernandes</creatorcontrib><creatorcontrib>Kusumanchi, T. P. S. Kumar</creatorcontrib><creatorcontrib>Ramamoorthi, Agilesh Saravanan</creatorcontrib><creatorcontrib>Ramanathula, Sireesha</creatorcontrib><creatorcontrib>Kongala, Raju</creatorcontrib><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>John, Bennilo Fernandes</au><au>Kusumanchi, T. P. S. Kumar</au><au>Ramamoorthi, Agilesh Saravanan</au><au>Ramanathula, Sireesha</au><au>Kongala, Raju</au><au>Chennakesavulu, M.</au><au>Ramanjaneyulu, N.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Clustering based emotional speech recognition using fuzzy and K-means for tamil language</atitle><btitle>AIP conference proceedings</btitle><date>2024-07-08</date><risdate>2024</risdate><volume>3028</volume><issue>1</issue><issn>0094-243X</issn><eissn>1551-7616</eissn><coden>APCPCS</coden><abstract>Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tamil emotional database through a comparative analysis of three Fuzzy prototype techniques: FCM, KFCM, and k-means algorithm. The goal is to identify the most efficient approach. The extraction of the emotional speech feature is analyzed via MFCC delta &amp; Spectral Skewness concatenation methodology to get more features for the analysis. For training and testing purposes, both male and female speech samples an emotional Tamil language database was developed with PCA &amp; followed by ICA technique in order to utilize the data for higher order dataset. Emotion detection outcomes have both the benefits and drawbacks of their own approach. Our analysis describes the efficiency of the KFCM methodology shows more than k-means and FCM approaches and analysis it’s clear that the emotions like anger, happy and normal shows higher rate of accuracy than other emotions with overall approximation of 84% of precision and also the time taken for the execution by KFCM is 0.238 mins, where the other methods take slightly higher time for execution.</abstract><cop>Melville</cop><pub>American Institute of Physics</pub><doi>10.1063/5.0213359</doi><tpages>11</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0094-243X
ispartof AIP conference proceedings, 2024, Vol.3028 (1)
issn 0094-243X
1551-7616
language eng
recordid cdi_proquest_journals_3076803782
source American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)
subjects Algorithms
Clustering
Emotion recognition
Emotions
Speech recognition
title Clustering based emotional speech recognition using fuzzy and K-means for tamil language
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-23T17%3A28%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_scita&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Clustering%20based%20emotional%20speech%20recognition%20using%20fuzzy%20and%20K-means%20for%20tamil%20language&rft.btitle=AIP%20conference%20proceedings&rft.au=John,%20Bennilo%20Fernandes&rft.date=2024-07-08&rft.volume=3028&rft.issue=1&rft.issn=0094-243X&rft.eissn=1551-7616&rft.coden=APCPCS&rft_id=info:doi/10.1063/5.0213359&rft_dat=%3Cproquest_scita%3E3076803782%3C/proquest_scita%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-p133t-982a12cbe37d86e69c715babf5bafcab53366937f906197ebaa6aa5dc9b73fef3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3076803782&rft_id=info:pmid/&rfr_iscdi=true