Loading…

Clustering based emotional speech recognition using fuzzy and K-means for tamil language

Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tami...

Full description

Saved in:

Bibliographic Details
Main Authors:	John, Bennilo Fernandes, Kusumanchi, T. P. S. Kumar, Ramamoorthi, Agilesh Saravanan, Ramanathula, Sireesha, Kongala, Raju
Format:	Conference Proceeding
Language:	English
Subjects:	Algorithms Clustering Emotion recognition Emotions Speech recognition
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites
container_end_page
container_issue	1
container_start_page
container_title
container_volume	3028
creator	John, Bennilo Fernandes Kusumanchi, T. P. S. Kumar Ramamoorthi, Agilesh Saravanan Ramanathula, Sireesha Kongala, Raju
description	Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tamil emotional database through a comparative analysis of three Fuzzy prototype techniques: FCM, KFCM, and k-means algorithm. The goal is to identify the most efficient approach. The extraction of the emotional speech feature is analyzed via MFCC delta & Spectral Skewness concatenation methodology to get more features for the analysis. For training and testing purposes, both male and female speech samples an emotional Tamil language database was developed with PCA & followed by ICA technique in order to utilize the data for higher order dataset. Emotion detection outcomes have both the benefits and drawbacks of their own approach. Our analysis describes the efficiency of the KFCM methodology shows more than k-means and FCM approaches and analysis it’s clear that the emotions like anger, happy and normal shows higher rate of accuracy than other emotions with overall approximation of 84% of precision and also the time taken for the execution by KFCM is 0.238 mins, where the other methods take slightly higher time for execution.
doi_str_mv	10.1063/5.0213359
format	conference_proceeding
fullrecord	<record><control><sourceid>proquest_scita</sourceid><recordid>TN_cdi_proquest_journals_3076803782</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3076803782</sourcerecordid><originalsourceid>FETCH-LOGICAL-p133t-982a12cbe37d86e69c715babf5bafcab53366937f906197ebaa6aa5dc9b73fef3</originalsourceid><addsrcrecordid>eNotkEtrwzAQhEVpoWnaQ_-BoLeCU8mKJOtYQl800EsLuZmVvHId_KpkH5JfX5vksgPLxzAzhNxztuJMiSe5YikXQpoLsuBS8kQrri7JgjGzTtK12F2Tmxj3jKVG62xBdpt6jAOGqi2phYgFxaYbqq6FmsYe0f3SgK4r22p-0jHOoB-PxwOFtqCfSYPQRuq7QAdoqprW0JYjlHhLrjzUEe_OuiQ_ry_fm_dk-_X2sXneJv0Uc0hMlgJPnUWhi0yhMk5zacH66XgHVgqhlBHaG6a40WgBFIAsnLFaePRiSR5Ovn3o_kaMQ77vxjDFj7lgWmVM6CydqMcTFV01wFwl70PVQDjknOXzcrnMz8uJf6t_YaM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>3076803782</pqid></control><display><type>conference_proceeding</type><title>Clustering based emotional speech recognition using fuzzy and K-means for tamil language</title><source>American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)</source><creator>John, Bennilo Fernandes ; Kusumanchi, T. P. S. Kumar ; Ramamoorthi, Agilesh Saravanan ; Ramanathula, Sireesha ; Kongala, Raju</creator><contributor>Chennakesavulu, M. ; Ramanjaneyulu, N.</contributor><creatorcontrib>John, Bennilo Fernandes ; Kusumanchi, T. P. S. Kumar ; Ramamoorthi, Agilesh Saravanan ; Ramanathula, Sireesha ; Kongala, Raju ; Chennakesavulu, M. ; Ramanjaneyulu, N.</creatorcontrib><description>Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tamil emotional database through a comparative analysis of three Fuzzy prototype techniques: FCM, KFCM, and k-means algorithm. The goal is to identify the most efficient approach. The extraction of the emotional speech feature is analyzed via MFCC delta & Spectral Skewness concatenation methodology to get more features for the analysis. For training and testing purposes, both male and female speech samples an emotional Tamil language database was developed with PCA & followed by ICA technique in order to utilize the data for higher order dataset. Emotion detection outcomes have both the benefits and drawbacks of their own approach. Our analysis describes the efficiency of the KFCM methodology shows more than k-means and FCM approaches and analysis it’s clear that the emotions like anger, happy and normal shows higher rate of accuracy than other emotions with overall approximation of 84% of precision and also the time taken for the execution by KFCM is 0.238 mins, where the other methods take slightly higher time for execution.</description><identifier>ISSN: 0094-243X</identifier><identifier>EISSN: 1551-7616</identifier><identifier>DOI: 10.1063/5.0213359</identifier><identifier>CODEN: APCPCS</identifier><language>eng</language><publisher>Melville: American Institute of Physics</publisher><subject>Algorithms ; Clustering ; Emotion recognition ; Emotions ; Speech recognition</subject><ispartof>AIP conference proceedings, 2024, Vol.3028 (1)</ispartof><rights>Author(s)</rights><rights>2024 Author(s). Published under an exclusive license by AIP Publishing.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>309,310,314,776,780,785,786,23909,23910,25118,27901,27902</link.rule.ids></links><search><contributor>Chennakesavulu, M.</contributor><contributor>Ramanjaneyulu, N.</contributor><creatorcontrib>John, Bennilo Fernandes</creatorcontrib><creatorcontrib>Kusumanchi, T. P. S. Kumar</creatorcontrib><creatorcontrib>Ramamoorthi, Agilesh Saravanan</creatorcontrib><creatorcontrib>Ramanathula, Sireesha</creatorcontrib><creatorcontrib>Kongala, Raju</creatorcontrib><title>Clustering based emotional speech recognition using fuzzy and K-means for tamil language</title><title>AIP conference proceedings</title><description>Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tamil emotional database through a comparative analysis of three Fuzzy prototype techniques: FCM, KFCM, and k-means algorithm. The goal is to identify the most efficient approach. The extraction of the emotional speech feature is analyzed via MFCC delta & Spectral Skewness concatenation methodology to get more features for the analysis. For training and testing purposes, both male and female speech samples an emotional Tamil language database was developed with PCA & followed by ICA technique in order to utilize the data for higher order dataset. Emotion detection outcomes have both the benefits and drawbacks of their own approach. Our analysis describes the efficiency of the KFCM methodology shows more than k-means and FCM approaches and analysis it’s clear that the emotions like anger, happy and normal shows higher rate of accuracy than other emotions with overall approximation of 84% of precision and also the time taken for the execution by KFCM is 0.238 mins, where the other methods take slightly higher time for execution.</description><subject>Algorithms</subject><subject>Clustering</subject><subject>Emotion recognition</subject><subject>Emotions</subject><subject>Speech recognition</subject><issn>0094-243X</issn><issn>1551-7616</issn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2024</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNotkEtrwzAQhEVpoWnaQ_-BoLeCU8mKJOtYQl800EsLuZmVvHId_KpkH5JfX5vksgPLxzAzhNxztuJMiSe5YikXQpoLsuBS8kQrri7JgjGzTtK12F2Tmxj3jKVG62xBdpt6jAOGqi2phYgFxaYbqq6FmsYe0f3SgK4r22p-0jHOoB-PxwOFtqCfSYPQRuq7QAdoqprW0JYjlHhLrjzUEe_OuiQ_ry_fm_dk-_X2sXneJv0Uc0hMlgJPnUWhi0yhMk5zacH66XgHVgqhlBHaG6a40WgBFIAsnLFaePRiSR5Ovn3o_kaMQ77vxjDFj7lgWmVM6CydqMcTFV01wFwl70PVQDjknOXzcrnMz8uJf6t_YaM</recordid><startdate>20240708</startdate><enddate>20240708</enddate><creator>John, Bennilo Fernandes</creator><creator>Kusumanchi, T. P. S. Kumar</creator><creator>Ramamoorthi, Agilesh Saravanan</creator><creator>Ramanathula, Sireesha</creator><creator>Kongala, Raju</creator><general>American Institute of Physics</general><scope>8FD</scope><scope>H8D</scope><scope>L7M</scope></search><sort><creationdate>20240708</creationdate><title>Clustering based emotional speech recognition using fuzzy and K-means for tamil language</title><author>John, Bennilo Fernandes ; Kusumanchi, T. P. S. Kumar ; Ramamoorthi, Agilesh Saravanan ; Ramanathula, Sireesha ; Kongala, Raju</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p133t-982a12cbe37d86e69c715babf5bafcab53366937f906197ebaa6aa5dc9b73fef3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Clustering</topic><topic>Emotion recognition</topic><topic>Emotions</topic><topic>Speech recognition</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>John, Bennilo Fernandes</creatorcontrib><creatorcontrib>Kusumanchi, T. P. S. Kumar</creatorcontrib><creatorcontrib>Ramamoorthi, Agilesh Saravanan</creatorcontrib><creatorcontrib>Ramanathula, Sireesha</creatorcontrib><creatorcontrib>Kongala, Raju</creatorcontrib><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>John, Bennilo Fernandes</au><au>Kusumanchi, T. P. S. Kumar</au><au>Ramamoorthi, Agilesh Saravanan</au><au>Ramanathula, Sireesha</au><au>Kongala, Raju</au><au>Chennakesavulu, M.</au><au>Ramanjaneyulu, N.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Clustering based emotional speech recognition using fuzzy and K-means for tamil language</atitle><btitle>AIP conference proceedings</btitle><date>2024-07-08</date><risdate>2024</risdate><volume>3028</volume><issue>1</issue><issn>0094-243X</issn><eissn>1551-7616</eissn><coden>APCPCS</coden><abstract>Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tamil emotional database through a comparative analysis of three Fuzzy prototype techniques: FCM, KFCM, and k-means algorithm. The goal is to identify the most efficient approach. The extraction of the emotional speech feature is analyzed via MFCC delta & Spectral Skewness concatenation methodology to get more features for the analysis. For training and testing purposes, both male and female speech samples an emotional Tamil language database was developed with PCA & followed by ICA technique in order to utilize the data for higher order dataset. Emotion detection outcomes have both the benefits and drawbacks of their own approach. Our analysis describes the efficiency of the KFCM methodology shows more than k-means and FCM approaches and analysis it’s clear that the emotions like anger, happy and normal shows higher rate of accuracy than other emotions with overall approximation of 84% of precision and also the time taken for the execution by KFCM is 0.238 mins, where the other methods take slightly higher time for execution.</abstract><cop>Melville</cop><pub>American Institute of Physics</pub><doi>10.1063/5.0213359</doi><tpages>11</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 0094-243X
ispartof	AIP conference proceedings, 2024, Vol.3028 (1)
issn	0094-243X 1551-7616
language	eng
recordid	cdi_proquest_journals_3076803782
source	American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)
subjects	Algorithms Clustering Emotion recognition Emotions Speech recognition
title	Clustering based emotional speech recognition using fuzzy and K-means for tamil language
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-23T17%3A28%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_scita&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Clustering%20based%20emotional%20speech%20recognition%20using%20fuzzy%20and%20K-means%20for%20tamil%20language&rft.btitle=AIP%20conference%20proceedings&rft.au=John,%20Bennilo%20Fernandes&rft.date=2024-07-08&rft.volume=3028&rft.issue=1&rft.issn=0094-243X&rft.eissn=1551-7616&rft.coden=APCPCS&rft_id=info:doi/10.1063/5.0213359&rft_dat=%3Cproquest_scita%3E3076803782%3C/proquest_scita%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-p133t-982a12cbe37d86e69c715babf5bafcab53366937f906197ebaa6aa5dc9b73fef3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3076803782&rft_id=info:pmid/&rfr_iscdi=true