Loading…
Clustering based emotional speech recognition using fuzzy and K-means for tamil language
Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tami...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | |
container_issue | 1 |
container_start_page | |
container_title | |
container_volume | 3028 |
creator | John, Bennilo Fernandes Kusumanchi, T. P. S. Kumar Ramamoorthi, Agilesh Saravanan Ramanathula, Sireesha Kongala, Raju |
description | Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tamil emotional database through a comparative analysis of three Fuzzy prototype techniques: FCM, KFCM, and k-means algorithm. The goal is to identify the most efficient approach. The extraction of the emotional speech feature is analyzed via MFCC delta & Spectral Skewness concatenation methodology to get more features for the analysis. For training and testing purposes, both male and female speech samples an emotional Tamil language database was developed with PCA & followed by ICA technique in order to utilize the data for higher order dataset. Emotion detection outcomes have both the benefits and drawbacks of their own approach. Our analysis describes the efficiency of the KFCM methodology shows more than k-means and FCM approaches and analysis it’s clear that the emotions like anger, happy and normal shows higher rate of accuracy than other emotions with overall approximation of 84% of precision and also the time taken for the execution by KFCM is 0.238 mins, where the other methods take slightly higher time for execution. |
doi_str_mv | 10.1063/5.0213359 |
format | conference_proceeding |
fullrecord | <record><control><sourceid>proquest_scita</sourceid><recordid>TN_cdi_proquest_journals_3076803782</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3076803782</sourcerecordid><originalsourceid>FETCH-LOGICAL-p133t-982a12cbe37d86e69c715babf5bafcab53366937f906197ebaa6aa5dc9b73fef3</originalsourceid><addsrcrecordid>eNotkEtrwzAQhEVpoWnaQ_-BoLeCU8mKJOtYQl800EsLuZmVvHId_KpkH5JfX5vksgPLxzAzhNxztuJMiSe5YikXQpoLsuBS8kQrri7JgjGzTtK12F2Tmxj3jKVG62xBdpt6jAOGqi2phYgFxaYbqq6FmsYe0f3SgK4r22p-0jHOoB-PxwOFtqCfSYPQRuq7QAdoqprW0JYjlHhLrjzUEe_OuiQ_ry_fm_dk-_X2sXneJv0Uc0hMlgJPnUWhi0yhMk5zacH66XgHVgqhlBHaG6a40WgBFIAsnLFaePRiSR5Ovn3o_kaMQ77vxjDFj7lgWmVM6CydqMcTFV01wFwl70PVQDjknOXzcrnMz8uJf6t_YaM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>3076803782</pqid></control><display><type>conference_proceeding</type><title>Clustering based emotional speech recognition using fuzzy and K-means for tamil language</title><source>American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list)</source><creator>John, Bennilo Fernandes ; Kusumanchi, T. P. S. Kumar ; Ramamoorthi, Agilesh Saravanan ; Ramanathula, Sireesha ; Kongala, Raju</creator><contributor>Chennakesavulu, M. ; Ramanjaneyulu, N.</contributor><creatorcontrib>John, Bennilo Fernandes ; Kusumanchi, T. P. S. Kumar ; Ramamoorthi, Agilesh Saravanan ; Ramanathula, Sireesha ; Kongala, Raju ; Chennakesavulu, M. ; Ramanjaneyulu, N.</creatorcontrib><description>Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tamil emotional database through a comparative analysis of three Fuzzy prototype techniques: FCM, KFCM, and k-means algorithm. The goal is to identify the most efficient approach. The extraction of the emotional speech feature is analyzed via MFCC delta & Spectral Skewness concatenation methodology to get more features for the analysis. For training and testing purposes, both male and female speech samples an emotional Tamil language database was developed with PCA & followed by ICA technique in order to utilize the data for higher order dataset. Emotion detection outcomes have both the benefits and drawbacks of their own approach. Our analysis describes the efficiency of the KFCM methodology shows more than k-means and FCM approaches and analysis it’s clear that the emotions like anger, happy and normal shows higher rate of accuracy than other emotions with overall approximation of 84% of precision and also the time taken for the execution by KFCM is 0.238 mins, where the other methods take slightly higher time for execution.</description><identifier>ISSN: 0094-243X</identifier><identifier>EISSN: 1551-7616</identifier><identifier>DOI: 10.1063/5.0213359</identifier><identifier>CODEN: APCPCS</identifier><language>eng</language><publisher>Melville: American Institute of Physics</publisher><subject>Algorithms ; Clustering ; Emotion recognition ; Emotions ; Speech recognition</subject><ispartof>AIP conference proceedings, 2024, Vol.3028 (1)</ispartof><rights>Author(s)</rights><rights>2024 Author(s). Published under an exclusive license by AIP Publishing.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>309,310,314,776,780,785,786,23909,23910,25118,27901,27902</link.rule.ids></links><search><contributor>Chennakesavulu, M.</contributor><contributor>Ramanjaneyulu, N.</contributor><creatorcontrib>John, Bennilo Fernandes</creatorcontrib><creatorcontrib>Kusumanchi, T. P. S. Kumar</creatorcontrib><creatorcontrib>Ramamoorthi, Agilesh Saravanan</creatorcontrib><creatorcontrib>Ramanathula, Sireesha</creatorcontrib><creatorcontrib>Kongala, Raju</creatorcontrib><title>Clustering based emotional speech recognition using fuzzy and K-means for tamil language</title><title>AIP conference proceedings</title><description>Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tamil emotional database through a comparative analysis of three Fuzzy prototype techniques: FCM, KFCM, and k-means algorithm. The goal is to identify the most efficient approach. The extraction of the emotional speech feature is analyzed via MFCC delta & Spectral Skewness concatenation methodology to get more features for the analysis. For training and testing purposes, both male and female speech samples an emotional Tamil language database was developed with PCA & followed by ICA technique in order to utilize the data for higher order dataset. Emotion detection outcomes have both the benefits and drawbacks of their own approach. Our analysis describes the efficiency of the KFCM methodology shows more than k-means and FCM approaches and analysis it’s clear that the emotions like anger, happy and normal shows higher rate of accuracy than other emotions with overall approximation of 84% of precision and also the time taken for the execution by KFCM is 0.238 mins, where the other methods take slightly higher time for execution.</description><subject>Algorithms</subject><subject>Clustering</subject><subject>Emotion recognition</subject><subject>Emotions</subject><subject>Speech recognition</subject><issn>0094-243X</issn><issn>1551-7616</issn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2024</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNotkEtrwzAQhEVpoWnaQ_-BoLeCU8mKJOtYQl800EsLuZmVvHId_KpkH5JfX5vksgPLxzAzhNxztuJMiSe5YikXQpoLsuBS8kQrri7JgjGzTtK12F2Tmxj3jKVG62xBdpt6jAOGqi2phYgFxaYbqq6FmsYe0f3SgK4r22p-0jHOoB-PxwOFtqCfSYPQRuq7QAdoqprW0JYjlHhLrjzUEe_OuiQ_ry_fm_dk-_X2sXneJv0Uc0hMlgJPnUWhi0yhMk5zacH66XgHVgqhlBHaG6a40WgBFIAsnLFaePRiSR5Ovn3o_kaMQ77vxjDFj7lgWmVM6CydqMcTFV01wFwl70PVQDjknOXzcrnMz8uJf6t_YaM</recordid><startdate>20240708</startdate><enddate>20240708</enddate><creator>John, Bennilo Fernandes</creator><creator>Kusumanchi, T. P. S. Kumar</creator><creator>Ramamoorthi, Agilesh Saravanan</creator><creator>Ramanathula, Sireesha</creator><creator>Kongala, Raju</creator><general>American Institute of Physics</general><scope>8FD</scope><scope>H8D</scope><scope>L7M</scope></search><sort><creationdate>20240708</creationdate><title>Clustering based emotional speech recognition using fuzzy and K-means for tamil language</title><author>John, Bennilo Fernandes ; Kusumanchi, T. P. S. Kumar ; Ramamoorthi, Agilesh Saravanan ; Ramanathula, Sireesha ; Kongala, Raju</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p133t-982a12cbe37d86e69c715babf5bafcab53366937f906197ebaa6aa5dc9b73fef3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Clustering</topic><topic>Emotion recognition</topic><topic>Emotions</topic><topic>Speech recognition</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>John, Bennilo Fernandes</creatorcontrib><creatorcontrib>Kusumanchi, T. P. S. Kumar</creatorcontrib><creatorcontrib>Ramamoorthi, Agilesh Saravanan</creatorcontrib><creatorcontrib>Ramanathula, Sireesha</creatorcontrib><creatorcontrib>Kongala, Raju</creatorcontrib><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>John, Bennilo Fernandes</au><au>Kusumanchi, T. P. S. Kumar</au><au>Ramamoorthi, Agilesh Saravanan</au><au>Ramanathula, Sireesha</au><au>Kongala, Raju</au><au>Chennakesavulu, M.</au><au>Ramanjaneyulu, N.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Clustering based emotional speech recognition using fuzzy and K-means for tamil language</atitle><btitle>AIP conference proceedings</btitle><date>2024-07-08</date><risdate>2024</risdate><volume>3028</volume><issue>1</issue><issn>0094-243X</issn><eissn>1551-7616</eissn><coden>APCPCS</coden><abstract>Communication among people and robots is still a difficult task where the machine should understand and respond to the manner in which it interacts like emotions, so that interaction between humans and computers seems simpler and natural. In this article, we examine a method for classifying the Tamil emotional database through a comparative analysis of three Fuzzy prototype techniques: FCM, KFCM, and k-means algorithm. The goal is to identify the most efficient approach. The extraction of the emotional speech feature is analyzed via MFCC delta & Spectral Skewness concatenation methodology to get more features for the analysis. For training and testing purposes, both male and female speech samples an emotional Tamil language database was developed with PCA & followed by ICA technique in order to utilize the data for higher order dataset. Emotion detection outcomes have both the benefits and drawbacks of their own approach. Our analysis describes the efficiency of the KFCM methodology shows more than k-means and FCM approaches and analysis it’s clear that the emotions like anger, happy and normal shows higher rate of accuracy than other emotions with overall approximation of 84% of precision and also the time taken for the execution by KFCM is 0.238 mins, where the other methods take slightly higher time for execution.</abstract><cop>Melville</cop><pub>American Institute of Physics</pub><doi>10.1063/5.0213359</doi><tpages>11</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0094-243X |
ispartof | AIP conference proceedings, 2024, Vol.3028 (1) |
issn | 0094-243X 1551-7616 |
language | eng |
recordid | cdi_proquest_journals_3076803782 |
source | American Institute of Physics:Jisc Collections:Transitional Journals Agreement 2021-23 (Reading list) |
subjects | Algorithms Clustering Emotion recognition Emotions Speech recognition |
title | Clustering based emotional speech recognition using fuzzy and K-means for tamil language |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-23T17%3A28%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_scita&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Clustering%20based%20emotional%20speech%20recognition%20using%20fuzzy%20and%20K-means%20for%20tamil%20language&rft.btitle=AIP%20conference%20proceedings&rft.au=John,%20Bennilo%20Fernandes&rft.date=2024-07-08&rft.volume=3028&rft.issue=1&rft.issn=0094-243X&rft.eissn=1551-7616&rft.coden=APCPCS&rft_id=info:doi/10.1063/5.0213359&rft_dat=%3Cproquest_scita%3E3076803782%3C/proquest_scita%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-p133t-982a12cbe37d86e69c715babf5bafcab53366937f906197ebaa6aa5dc9b73fef3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3076803782&rft_id=info:pmid/&rfr_iscdi=true |