Loading…
Trends in speech emotion recognition: a comprehensive survey
Among the other modes of communication, such as text, body language, facial expressions, and so on, human beings employ speech as the most common. It contains a great deal of information, including the speaker’s feelings. Detecting the speaker’s emotions from his or her speech has shown to be quite...
Saved in:
Published in: | Multimedia tools and applications 2023-08, Vol.82 (19), p.29307-29351 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613 |
---|---|
cites | cdi_FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613 |
container_end_page | 29351 |
container_issue | 19 |
container_start_page | 29307 |
container_title | Multimedia tools and applications |
container_volume | 82 |
creator | Kaur, Kamaldeep Singh, Parminder |
description | Among the other modes of communication, such as text, body language, facial expressions, and so on, human beings employ speech as the most common. It contains a great deal of information, including the speaker’s feelings. Detecting the speaker’s emotions from his or her speech has shown to be quite useful in a variety of real-world applications. The dataset development, feature extraction, feature selection/dimensionality reduction, and classification are the four primary processes in the Speech Emotion Recognition process. In this context, more than 70 studies are thoroughly examined in terms of their databases, emotions, features extracted, and classifiers employed. The databases, characteristics, extraction and classification methods, as well as the results, are all thoroughly examined. The study also includes a comparative analysis of these research papers. |
doi_str_mv | 10.1007/s11042-023-14656-y |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2840671229</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2840671229</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613</originalsourceid><addsrcrecordid>eNp9kEtLAzEUhYMoWKt_wFXAdTQ3zxlxI8UXFNzUdZhm7rRTbDImbWH-vVNHcOfqnsX5zoWPkGvgt8C5vcsAXAnGhWSgjDasPyET0FYyawWcDlkWnFnN4Zxc5LzhHIwWakIeFglDnWkbaO4Q_ZriNu7aGGhCH1ehPeZ7WlEft13CNYbcHpDmfTpgf0nOmuoz49XvnZKP56fF7JXN31_eZo9z5iWUO6aLpS8U1Nr4WnilGoCq9gXUQpRaFqAFcOOVlqIW1pqywiWWQlVlAbypDMgpuRl3uxS_9ph3bhP3KQwvnSgUNxaGpaElxpZPMeeEjetSu61S74C7oyU3WnKDJfdjyfUDJEcoD-WwwvQ3_Q_1DftgacQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2840671229</pqid></control><display><type>article</type><title>Trends in speech emotion recognition: a comprehensive survey</title><source>ABI/INFORM Global</source><source>Springer Link</source><creator>Kaur, Kamaldeep ; Singh, Parminder</creator><creatorcontrib>Kaur, Kamaldeep ; Singh, Parminder</creatorcontrib><description>Among the other modes of communication, such as text, body language, facial expressions, and so on, human beings employ speech as the most common. It contains a great deal of information, including the speaker’s feelings. Detecting the speaker’s emotions from his or her speech has shown to be quite useful in a variety of real-world applications. The dataset development, feature extraction, feature selection/dimensionality reduction, and classification are the four primary processes in the Speech Emotion Recognition process. In this context, more than 70 studies are thoroughly examined in terms of their databases, emotions, features extracted, and classifiers employed. The databases, characteristics, extraction and classification methods, as well as the results, are all thoroughly examined. The study also includes a comparative analysis of these research papers.</description><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-023-14656-y</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Classification ; Communication ; Computer Communication Networks ; Computer Science ; Data Structures and Information Theory ; Emotion recognition ; Emotions ; Feature extraction ; Human communication ; Multimedia ; Multimedia Information Systems ; Nervous system ; Special Purpose and Application-Based Systems ; Speech ; Speech recognition ; Trends</subject><ispartof>Multimedia tools and applications, 2023-08, Vol.82 (19), p.29307-29351</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613</citedby><cites>FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613</cites><orcidid>0000-0002-3542-1214</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/2840671229/fulltextPDF?pq-origsite=primo$$EPDF$$P50$$Gproquest$$H</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/2840671229?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,11688,27924,27925,36060,44363,74895</link.rule.ids></links><search><creatorcontrib>Kaur, Kamaldeep</creatorcontrib><creatorcontrib>Singh, Parminder</creatorcontrib><title>Trends in speech emotion recognition: a comprehensive survey</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>Among the other modes of communication, such as text, body language, facial expressions, and so on, human beings employ speech as the most common. It contains a great deal of information, including the speaker’s feelings. Detecting the speaker’s emotions from his or her speech has shown to be quite useful in a variety of real-world applications. The dataset development, feature extraction, feature selection/dimensionality reduction, and classification are the four primary processes in the Speech Emotion Recognition process. In this context, more than 70 studies are thoroughly examined in terms of their databases, emotions, features extracted, and classifiers employed. The databases, characteristics, extraction and classification methods, as well as the results, are all thoroughly examined. The study also includes a comparative analysis of these research papers.</description><subject>Classification</subject><subject>Communication</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Emotion recognition</subject><subject>Emotions</subject><subject>Feature extraction</subject><subject>Human communication</subject><subject>Multimedia</subject><subject>Multimedia Information Systems</subject><subject>Nervous system</subject><subject>Special Purpose and Application-Based Systems</subject><subject>Speech</subject><subject>Speech recognition</subject><subject>Trends</subject><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>M0C</sourceid><recordid>eNp9kEtLAzEUhYMoWKt_wFXAdTQ3zxlxI8UXFNzUdZhm7rRTbDImbWH-vVNHcOfqnsX5zoWPkGvgt8C5vcsAXAnGhWSgjDasPyET0FYyawWcDlkWnFnN4Zxc5LzhHIwWakIeFglDnWkbaO4Q_ZriNu7aGGhCH1ehPeZ7WlEft13CNYbcHpDmfTpgf0nOmuoz49XvnZKP56fF7JXN31_eZo9z5iWUO6aLpS8U1Nr4WnilGoCq9gXUQpRaFqAFcOOVlqIW1pqywiWWQlVlAbypDMgpuRl3uxS_9ph3bhP3KQwvnSgUNxaGpaElxpZPMeeEjetSu61S74C7oyU3WnKDJfdjyfUDJEcoD-WwwvQ3_Q_1DftgacQ</recordid><startdate>20230801</startdate><enddate>20230801</enddate><creator>Kaur, Kamaldeep</creator><creator>Singh, Parminder</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0002-3542-1214</orcidid></search><sort><creationdate>20230801</creationdate><title>Trends in speech emotion recognition: a comprehensive survey</title><author>Kaur, Kamaldeep ; Singh, Parminder</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Classification</topic><topic>Communication</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Emotion recognition</topic><topic>Emotions</topic><topic>Feature extraction</topic><topic>Human communication</topic><topic>Multimedia</topic><topic>Multimedia Information Systems</topic><topic>Nervous system</topic><topic>Special Purpose and Application-Based Systems</topic><topic>Speech</topic><topic>Speech recognition</topic><topic>Trends</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kaur, Kamaldeep</creatorcontrib><creatorcontrib>Singh, Parminder</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI商业信息数据库</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer science database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>ProQuest research library</collection><collection>Research Library (Corporate)</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>One Business (ProQuest)</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kaur, Kamaldeep</au><au>Singh, Parminder</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Trends in speech emotion recognition: a comprehensive survey</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2023-08-01</date><risdate>2023</risdate><volume>82</volume><issue>19</issue><spage>29307</spage><epage>29351</epage><pages>29307-29351</pages><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>Among the other modes of communication, such as text, body language, facial expressions, and so on, human beings employ speech as the most common. It contains a great deal of information, including the speaker’s feelings. Detecting the speaker’s emotions from his or her speech has shown to be quite useful in a variety of real-world applications. The dataset development, feature extraction, feature selection/dimensionality reduction, and classification are the four primary processes in the Speech Emotion Recognition process. In this context, more than 70 studies are thoroughly examined in terms of their databases, emotions, features extracted, and classifiers employed. The databases, characteristics, extraction and classification methods, as well as the results, are all thoroughly examined. The study also includes a comparative analysis of these research papers.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11042-023-14656-y</doi><tpages>45</tpages><orcidid>https://orcid.org/0000-0002-3542-1214</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1380-7501 |
ispartof | Multimedia tools and applications, 2023-08, Vol.82 (19), p.29307-29351 |
issn | 1380-7501 1573-7721 |
language | eng |
recordid | cdi_proquest_journals_2840671229 |
source | ABI/INFORM Global; Springer Link |
subjects | Classification Communication Computer Communication Networks Computer Science Data Structures and Information Theory Emotion recognition Emotions Feature extraction Human communication Multimedia Multimedia Information Systems Nervous system Special Purpose and Application-Based Systems Speech Speech recognition Trends |
title | Trends in speech emotion recognition: a comprehensive survey |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T12%3A49%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Trends%20in%20speech%20emotion%20recognition:%20a%20comprehensive%20survey&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Kaur,%20Kamaldeep&rft.date=2023-08-01&rft.volume=82&rft.issue=19&rft.spage=29307&rft.epage=29351&rft.pages=29307-29351&rft.issn=1380-7501&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-023-14656-y&rft_dat=%3Cproquest_cross%3E2840671229%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c319t-58bc841d56cd2c44f11adc81d229538152106c4532d27769aebe924a9810fa613%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2840671229&rft_id=info:pmid/&rfr_iscdi=true |