Loading…
Block-segmentation vectors for arousal prediction using semi-supervised learning
To handle emotional expressions in computer applications, Russell’s circumplex model has been useful for representing emotions according to valence and arousal. In SentiWordNet, the level of valence is automatically assigned to a large number of synsets (groups of synonyms in WordNet) using semi-sup...
Saved in:
Published in: | Applied soft computing 2023-07, Vol.142, p.110327, Article 110327 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c251t-cb856d2e0b7a696e1cd08aa1df2e60e729539dceee2b010317d3f22b36d951e83 |
container_end_page | |
container_issue | |
container_start_page | 110327 |
container_title | Applied soft computing |
container_volume | 142 |
creator | Odaka, Yuki Kaneiwa, Ken |
description | To handle emotional expressions in computer applications, Russell’s circumplex model has been useful for representing emotions according to valence and arousal. In SentiWordNet, the level of valence is automatically assigned to a large number of synsets (groups of synonyms in WordNet) using semi-supervised learning. However, when assigning the level of arousal, the existing method proposed for SentiWordNet reduces the accuracy of sentiment prediction. In this paper, we propose a block-segmentation vector for predicting the arousal levels of many synsets from a small number of labeled words using semi-supervised learning. We analyze the distribution of arousal and non-arousal words in a corpus of sentences by comparing it with the distribution of valence words. We address the problem that arousal level prediction fails when arousal and non-arousal words are mixed together in some sentences. To capture the features of such arousal and non-arousal words, we generate word vectors based on inverted indexes by block IDs, where the corpus is divided into blocks in the flow of sentences. In the evaluation experiment, we show that the results of arousal prediction with the block-segmentation vectors using semi-supervised learning outperform the results of the previous methods in SentiWordNet and SocialSent.
•Proposing a block-segmentation vector to predict arousal levels for WordNet synsets.•Analyzing word distribution shows labeling arousal is more difficult than valence.•Feature-selected block-segmentation vectors improve arousal prediction accuracy.•Segmenting a corpus into blocks detects arousal levels in long sentence contexts. |
doi_str_mv | 10.1016/j.asoc.2023.110327 |
format | article |
fullrecord | <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_asoc_2023_110327</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1568494623003459</els_id><sourcerecordid>S1568494623003459</sourcerecordid><originalsourceid>FETCH-LOGICAL-c251t-cb856d2e0b7a696e1cd08aa1df2e60e729539dceee2b010317d3f22b36d951e83</originalsourceid><addsrcrecordid>eNp9kMFKAzEQhoMoWKsv4GlfIGuS7WYT8KJFq1DQg55DNpktqdtNyewWfHtT69nTDPx8w_wfIbeclZxxebctLUZXCiaqknNWieaMzLhqBNVS8fO811LRhV7IS3KFuGUZ0kLNyPtjH90XRdjsYBjtGOJQHMCNMWHRxVTYFCe0fbFP4IP7jScMw6ZA2AWK0x7SISD4ogebhhxck4vO9gg3f3NOPp-fPpYvdP22el0-rKkTNR-pa1UtvQDWNlZqCdx5pqzlvhMgGTRC15X2DgBEy3Ih3viqE6KtpNc1B1XNiTjddSkiJujMPoWdTd-GM3N0Yrbm6MQcnZiTkwzdnyDInx0CJIMuwOByt5RLGx_Df_gPhohslA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Block-segmentation vectors for arousal prediction using semi-supervised learning</title><source>ScienceDirect Freedom Collection</source><creator>Odaka, Yuki ; Kaneiwa, Ken</creator><creatorcontrib>Odaka, Yuki ; Kaneiwa, Ken</creatorcontrib><description>To handle emotional expressions in computer applications, Russell’s circumplex model has been useful for representing emotions according to valence and arousal. In SentiWordNet, the level of valence is automatically assigned to a large number of synsets (groups of synonyms in WordNet) using semi-supervised learning. However, when assigning the level of arousal, the existing method proposed for SentiWordNet reduces the accuracy of sentiment prediction. In this paper, we propose a block-segmentation vector for predicting the arousal levels of many synsets from a small number of labeled words using semi-supervised learning. We analyze the distribution of arousal and non-arousal words in a corpus of sentences by comparing it with the distribution of valence words. We address the problem that arousal level prediction fails when arousal and non-arousal words are mixed together in some sentences. To capture the features of such arousal and non-arousal words, we generate word vectors based on inverted indexes by block IDs, where the corpus is divided into blocks in the flow of sentences. In the evaluation experiment, we show that the results of arousal prediction with the block-segmentation vectors using semi-supervised learning outperform the results of the previous methods in SentiWordNet and SocialSent.
•Proposing a block-segmentation vector to predict arousal levels for WordNet synsets.•Analyzing word distribution shows labeling arousal is more difficult than valence.•Feature-selected block-segmentation vectors improve arousal prediction accuracy.•Segmenting a corpus into blocks detects arousal levels in long sentence contexts.</description><identifier>ISSN: 1568-4946</identifier><identifier>EISSN: 1872-9681</identifier><identifier>DOI: 10.1016/j.asoc.2023.110327</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Arousal ; Semi-supervised learning ; Sentiment analysis ; Word embedding</subject><ispartof>Applied soft computing, 2023-07, Vol.142, p.110327, Article 110327</ispartof><rights>2023 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c251t-cb856d2e0b7a696e1cd08aa1df2e60e729539dceee2b010317d3f22b36d951e83</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Odaka, Yuki</creatorcontrib><creatorcontrib>Kaneiwa, Ken</creatorcontrib><title>Block-segmentation vectors for arousal prediction using semi-supervised learning</title><title>Applied soft computing</title><description>To handle emotional expressions in computer applications, Russell’s circumplex model has been useful for representing emotions according to valence and arousal. In SentiWordNet, the level of valence is automatically assigned to a large number of synsets (groups of synonyms in WordNet) using semi-supervised learning. However, when assigning the level of arousal, the existing method proposed for SentiWordNet reduces the accuracy of sentiment prediction. In this paper, we propose a block-segmentation vector for predicting the arousal levels of many synsets from a small number of labeled words using semi-supervised learning. We analyze the distribution of arousal and non-arousal words in a corpus of sentences by comparing it with the distribution of valence words. We address the problem that arousal level prediction fails when arousal and non-arousal words are mixed together in some sentences. To capture the features of such arousal and non-arousal words, we generate word vectors based on inverted indexes by block IDs, where the corpus is divided into blocks in the flow of sentences. In the evaluation experiment, we show that the results of arousal prediction with the block-segmentation vectors using semi-supervised learning outperform the results of the previous methods in SentiWordNet and SocialSent.
•Proposing a block-segmentation vector to predict arousal levels for WordNet synsets.•Analyzing word distribution shows labeling arousal is more difficult than valence.•Feature-selected block-segmentation vectors improve arousal prediction accuracy.•Segmenting a corpus into blocks detects arousal levels in long sentence contexts.</description><subject>Arousal</subject><subject>Semi-supervised learning</subject><subject>Sentiment analysis</subject><subject>Word embedding</subject><issn>1568-4946</issn><issn>1872-9681</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kMFKAzEQhoMoWKsv4GlfIGuS7WYT8KJFq1DQg55DNpktqdtNyewWfHtT69nTDPx8w_wfIbeclZxxebctLUZXCiaqknNWieaMzLhqBNVS8fO811LRhV7IS3KFuGUZ0kLNyPtjH90XRdjsYBjtGOJQHMCNMWHRxVTYFCe0fbFP4IP7jScMw6ZA2AWK0x7SISD4ogebhhxck4vO9gg3f3NOPp-fPpYvdP22el0-rKkTNR-pa1UtvQDWNlZqCdx5pqzlvhMgGTRC15X2DgBEy3Ih3viqE6KtpNc1B1XNiTjddSkiJujMPoWdTd-GM3N0Yrbm6MQcnZiTkwzdnyDInx0CJIMuwOByt5RLGx_Df_gPhohslA</recordid><startdate>202307</startdate><enddate>202307</enddate><creator>Odaka, Yuki</creator><creator>Kaneiwa, Ken</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>202307</creationdate><title>Block-segmentation vectors for arousal prediction using semi-supervised learning</title><author>Odaka, Yuki ; Kaneiwa, Ken</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c251t-cb856d2e0b7a696e1cd08aa1df2e60e729539dceee2b010317d3f22b36d951e83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Arousal</topic><topic>Semi-supervised learning</topic><topic>Sentiment analysis</topic><topic>Word embedding</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Odaka, Yuki</creatorcontrib><creatorcontrib>Kaneiwa, Ken</creatorcontrib><collection>CrossRef</collection><jtitle>Applied soft computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Odaka, Yuki</au><au>Kaneiwa, Ken</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Block-segmentation vectors for arousal prediction using semi-supervised learning</atitle><jtitle>Applied soft computing</jtitle><date>2023-07</date><risdate>2023</risdate><volume>142</volume><spage>110327</spage><pages>110327-</pages><artnum>110327</artnum><issn>1568-4946</issn><eissn>1872-9681</eissn><abstract>To handle emotional expressions in computer applications, Russell’s circumplex model has been useful for representing emotions according to valence and arousal. In SentiWordNet, the level of valence is automatically assigned to a large number of synsets (groups of synonyms in WordNet) using semi-supervised learning. However, when assigning the level of arousal, the existing method proposed for SentiWordNet reduces the accuracy of sentiment prediction. In this paper, we propose a block-segmentation vector for predicting the arousal levels of many synsets from a small number of labeled words using semi-supervised learning. We analyze the distribution of arousal and non-arousal words in a corpus of sentences by comparing it with the distribution of valence words. We address the problem that arousal level prediction fails when arousal and non-arousal words are mixed together in some sentences. To capture the features of such arousal and non-arousal words, we generate word vectors based on inverted indexes by block IDs, where the corpus is divided into blocks in the flow of sentences. In the evaluation experiment, we show that the results of arousal prediction with the block-segmentation vectors using semi-supervised learning outperform the results of the previous methods in SentiWordNet and SocialSent.
•Proposing a block-segmentation vector to predict arousal levels for WordNet synsets.•Analyzing word distribution shows labeling arousal is more difficult than valence.•Feature-selected block-segmentation vectors improve arousal prediction accuracy.•Segmenting a corpus into blocks detects arousal levels in long sentence contexts.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.asoc.2023.110327</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1568-4946 |
ispartof | Applied soft computing, 2023-07, Vol.142, p.110327, Article 110327 |
issn | 1568-4946 1872-9681 |
language | eng |
recordid | cdi_crossref_primary_10_1016_j_asoc_2023_110327 |
source | ScienceDirect Freedom Collection |
subjects | Arousal Semi-supervised learning Sentiment analysis Word embedding |
title | Block-segmentation vectors for arousal prediction using semi-supervised learning |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T23%3A12%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Block-segmentation%20vectors%20for%20arousal%20prediction%20using%20semi-supervised%20learning&rft.jtitle=Applied%20soft%20computing&rft.au=Odaka,%20Yuki&rft.date=2023-07&rft.volume=142&rft.spage=110327&rft.pages=110327-&rft.artnum=110327&rft.issn=1568-4946&rft.eissn=1872-9681&rft_id=info:doi/10.1016/j.asoc.2023.110327&rft_dat=%3Celsevier_cross%3ES1568494623003459%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c251t-cb856d2e0b7a696e1cd08aa1df2e60e729539dceee2b010317d3f22b36d951e83%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |