Loading…

Trends in speech emotion recognition: a comprehensive survey

Among the other modes of communication, such as text, body language, facial expressions, and so on, human beings employ speech as the most common. It contains a great deal of information, including the speaker’s feelings. Detecting the speaker’s emotions from his or her speech has shown to be quite...

Full description

Saved in:
Bibliographic Details
Published in:Multimedia tools and applications 2023-08, Vol.82 (19), p.29307-29351
Main Authors: Kaur, Kamaldeep, Singh, Parminder
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613
cites cdi_FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613
container_end_page 29351
container_issue 19
container_start_page 29307
container_title Multimedia tools and applications
container_volume 82
creator Kaur, Kamaldeep
Singh, Parminder
description Among the other modes of communication, such as text, body language, facial expressions, and so on, human beings employ speech as the most common. It contains a great deal of information, including the speaker’s feelings. Detecting the speaker’s emotions from his or her speech has shown to be quite useful in a variety of real-world applications. The dataset development, feature extraction, feature selection/dimensionality reduction, and classification are the four primary processes in the Speech Emotion Recognition process. In this context, more than 70 studies are thoroughly examined in terms of their databases, emotions, features extracted, and classifiers employed. The databases, characteristics, extraction and classification methods, as well as the results, are all thoroughly examined. The study also includes a comparative analysis of these research papers.
doi_str_mv 10.1007/s11042-023-14656-y
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2840671229</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2840671229</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613</originalsourceid><addsrcrecordid>eNp9kEtLAzEUhYMoWKt_wFXAdTQ3zxlxI8UXFNzUdZhm7rRTbDImbWH-vVNHcOfqnsX5zoWPkGvgt8C5vcsAXAnGhWSgjDasPyET0FYyawWcDlkWnFnN4Zxc5LzhHIwWakIeFglDnWkbaO4Q_ZriNu7aGGhCH1ehPeZ7WlEft13CNYbcHpDmfTpgf0nOmuoz49XvnZKP56fF7JXN31_eZo9z5iWUO6aLpS8U1Nr4WnilGoCq9gXUQpRaFqAFcOOVlqIW1pqywiWWQlVlAbypDMgpuRl3uxS_9ph3bhP3KQwvnSgUNxaGpaElxpZPMeeEjetSu61S74C7oyU3WnKDJfdjyfUDJEcoD-WwwvQ3_Q_1DftgacQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2840671229</pqid></control><display><type>article</type><title>Trends in speech emotion recognition: a comprehensive survey</title><source>ABI/INFORM Global</source><source>Springer Link</source><creator>Kaur, Kamaldeep ; Singh, Parminder</creator><creatorcontrib>Kaur, Kamaldeep ; Singh, Parminder</creatorcontrib><description>Among the other modes of communication, such as text, body language, facial expressions, and so on, human beings employ speech as the most common. It contains a great deal of information, including the speaker’s feelings. Detecting the speaker’s emotions from his or her speech has shown to be quite useful in a variety of real-world applications. The dataset development, feature extraction, feature selection/dimensionality reduction, and classification are the four primary processes in the Speech Emotion Recognition process. In this context, more than 70 studies are thoroughly examined in terms of their databases, emotions, features extracted, and classifiers employed. The databases, characteristics, extraction and classification methods, as well as the results, are all thoroughly examined. The study also includes a comparative analysis of these research papers.</description><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-023-14656-y</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Classification ; Communication ; Computer Communication Networks ; Computer Science ; Data Structures and Information Theory ; Emotion recognition ; Emotions ; Feature extraction ; Human communication ; Multimedia ; Multimedia Information Systems ; Nervous system ; Special Purpose and Application-Based Systems ; Speech ; Speech recognition ; Trends</subject><ispartof>Multimedia tools and applications, 2023-08, Vol.82 (19), p.29307-29351</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613</citedby><cites>FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613</cites><orcidid>0000-0002-3542-1214</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2840671229/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2840671229?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,11688,27924,27925,36060,44363,74895</link.rule.ids></links><search><creatorcontrib>Kaur, Kamaldeep</creatorcontrib><creatorcontrib>Singh, Parminder</creatorcontrib><title>Trends in speech emotion recognition: a comprehensive survey</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>Among the other modes of communication, such as text, body language, facial expressions, and so on, human beings employ speech as the most common. It contains a great deal of information, including the speaker’s feelings. Detecting the speaker’s emotions from his or her speech has shown to be quite useful in a variety of real-world applications. The dataset development, feature extraction, feature selection/dimensionality reduction, and classification are the four primary processes in the Speech Emotion Recognition process. In this context, more than 70 studies are thoroughly examined in terms of their databases, emotions, features extracted, and classifiers employed. The databases, characteristics, extraction and classification methods, as well as the results, are all thoroughly examined. The study also includes a comparative analysis of these research papers.</description><subject>Classification</subject><subject>Communication</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Emotion recognition</subject><subject>Emotions</subject><subject>Feature extraction</subject><subject>Human communication</subject><subject>Multimedia</subject><subject>Multimedia Information Systems</subject><subject>Nervous system</subject><subject>Special Purpose and Application-Based Systems</subject><subject>Speech</subject><subject>Speech recognition</subject><subject>Trends</subject><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>M0C</sourceid><recordid>eNp9kEtLAzEUhYMoWKt_wFXAdTQ3zxlxI8UXFNzUdZhm7rRTbDImbWH-vVNHcOfqnsX5zoWPkGvgt8C5vcsAXAnGhWSgjDasPyET0FYyawWcDlkWnFnN4Zxc5LzhHIwWakIeFglDnWkbaO4Q_ZriNu7aGGhCH1ehPeZ7WlEft13CNYbcHpDmfTpgf0nOmuoz49XvnZKP56fF7JXN31_eZo9z5iWUO6aLpS8U1Nr4WnilGoCq9gXUQpRaFqAFcOOVlqIW1pqywiWWQlVlAbypDMgpuRl3uxS_9ph3bhP3KQwvnSgUNxaGpaElxpZPMeeEjetSu61S74C7oyU3WnKDJfdjyfUDJEcoD-WwwvQ3_Q_1DftgacQ</recordid><startdate>20230801</startdate><enddate>20230801</enddate><creator>Kaur, Kamaldeep</creator><creator>Singh, Parminder</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0002-3542-1214</orcidid></search><sort><creationdate>20230801</creationdate><title>Trends in speech emotion recognition: a comprehensive survey</title><author>Kaur, Kamaldeep ; Singh, Parminder</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Classification</topic><topic>Communication</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Emotion recognition</topic><topic>Emotions</topic><topic>Feature extraction</topic><topic>Human communication</topic><topic>Multimedia</topic><topic>Multimedia Information Systems</topic><topic>Nervous system</topic><topic>Special Purpose and Application-Based Systems</topic><topic>Speech</topic><topic>Speech recognition</topic><topic>Trends</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kaur, Kamaldeep</creatorcontrib><creatorcontrib>Singh, Parminder</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI商业信息数据库</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer science database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>ProQuest research library</collection><collection>Research Library (Corporate)</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>One Business (ProQuest)</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kaur, Kamaldeep</au><au>Singh, Parminder</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Trends in speech emotion recognition: a comprehensive survey</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2023-08-01</date><risdate>2023</risdate><volume>82</volume><issue>19</issue><spage>29307</spage><epage>29351</epage><pages>29307-29351</pages><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>Among the other modes of communication, such as text, body language, facial expressions, and so on, human beings employ speech as the most common. It contains a great deal of information, including the speaker’s feelings. Detecting the speaker’s emotions from his or her speech has shown to be quite useful in a variety of real-world applications. The dataset development, feature extraction, feature selection/dimensionality reduction, and classification are the four primary processes in the Speech Emotion Recognition process. In this context, more than 70 studies are thoroughly examined in terms of their databases, emotions, features extracted, and classifiers employed. The databases, characteristics, extraction and classification methods, as well as the results, are all thoroughly examined. The study also includes a comparative analysis of these research papers.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11042-023-14656-y</doi><tpages>45</tpages><orcidid>https://orcid.org/0000-0002-3542-1214</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1380-7501
ispartof Multimedia tools and applications, 2023-08, Vol.82 (19), p.29307-29351
issn 1380-7501
1573-7721
language eng
recordid cdi_proquest_journals_2840671229
source ABI/INFORM Global; Springer Link
subjects Classification
Communication
Computer Communication Networks
Computer Science
Data Structures and Information Theory
Emotion recognition
Emotions
Feature extraction
Human communication
Multimedia
Multimedia Information Systems
Nervous system
Special Purpose and Application-Based Systems
Speech
Speech recognition
Trends
title Trends in speech emotion recognition: a comprehensive survey
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T12%3A49%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Trends%20in%20speech%20emotion%20recognition:%20a%20comprehensive%20survey&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Kaur,%20Kamaldeep&rft.date=2023-08-01&rft.volume=82&rft.issue=19&rft.spage=29307&rft.epage=29351&rft.pages=29307-29351&rft.issn=1380-7501&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-023-14656-y&rft_dat=%3Cproquest_cross%3E2840671229%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2840671229&rft_id=info:pmid/&rfr_iscdi=true