Loading…

Block-segmentation vectors for arousal prediction using semi-supervised learning

To handle emotional expressions in computer applications, Russell’s circumplex model has been useful for representing emotions according to valence and arousal. In SentiWordNet, the level of valence is automatically assigned to a large number of synsets (groups of synonyms in WordNet) using semi-sup...

Full description

Saved in:
Bibliographic Details
Published in:Applied soft computing 2023-07, Vol.142, p.110327, Article 110327
Main Authors: Odaka, Yuki, Kaneiwa, Ken
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c251t-cb856d2e0b7a696e1cd08aa1df2e60e729539dceee2b010317d3f22b36d951e83
container_end_page
container_issue
container_start_page 110327
container_title Applied soft computing
container_volume 142
creator Odaka, Yuki
Kaneiwa, Ken
description To handle emotional expressions in computer applications, Russell’s circumplex model has been useful for representing emotions according to valence and arousal. In SentiWordNet, the level of valence is automatically assigned to a large number of synsets (groups of synonyms in WordNet) using semi-supervised learning. However, when assigning the level of arousal, the existing method proposed for SentiWordNet reduces the accuracy of sentiment prediction. In this paper, we propose a block-segmentation vector for predicting the arousal levels of many synsets from a small number of labeled words using semi-supervised learning. We analyze the distribution of arousal and non-arousal words in a corpus of sentences by comparing it with the distribution of valence words. We address the problem that arousal level prediction fails when arousal and non-arousal words are mixed together in some sentences. To capture the features of such arousal and non-arousal words, we generate word vectors based on inverted indexes by block IDs, where the corpus is divided into blocks in the flow of sentences. In the evaluation experiment, we show that the results of arousal prediction with the block-segmentation vectors using semi-supervised learning outperform the results of the previous methods in SentiWordNet and SocialSent. •Proposing a block-segmentation vector to predict arousal levels for WordNet synsets.•Analyzing word distribution shows labeling arousal is more difficult than valence.•Feature-selected block-segmentation vectors improve arousal prediction accuracy.•Segmenting a corpus into blocks detects arousal levels in long sentence contexts.
doi_str_mv 10.1016/j.asoc.2023.110327
format article
fullrecord <record><control><sourceid>elsevier_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1016_j_asoc_2023_110327</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1568494623003459</els_id><sourcerecordid>S1568494623003459</sourcerecordid><originalsourceid>FETCH-LOGICAL-c251t-cb856d2e0b7a696e1cd08aa1df2e60e729539dceee2b010317d3f22b36d951e83</originalsourceid><addsrcrecordid>eNp9kMFKAzEQhoMoWKsv4GlfIGuS7WYT8KJFq1DQg55DNpktqdtNyewWfHtT69nTDPx8w_wfIbeclZxxebctLUZXCiaqknNWieaMzLhqBNVS8fO811LRhV7IS3KFuGUZ0kLNyPtjH90XRdjsYBjtGOJQHMCNMWHRxVTYFCe0fbFP4IP7jScMw6ZA2AWK0x7SISD4ogebhhxck4vO9gg3f3NOPp-fPpYvdP22el0-rKkTNR-pa1UtvQDWNlZqCdx5pqzlvhMgGTRC15X2DgBEy3Ih3viqE6KtpNc1B1XNiTjddSkiJujMPoWdTd-GM3N0Yrbm6MQcnZiTkwzdnyDInx0CJIMuwOByt5RLGx_Df_gPhohslA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Block-segmentation vectors for arousal prediction using semi-supervised learning</title><source>ScienceDirect Freedom Collection</source><creator>Odaka, Yuki ; Kaneiwa, Ken</creator><creatorcontrib>Odaka, Yuki ; Kaneiwa, Ken</creatorcontrib><description>To handle emotional expressions in computer applications, Russell’s circumplex model has been useful for representing emotions according to valence and arousal. In SentiWordNet, the level of valence is automatically assigned to a large number of synsets (groups of synonyms in WordNet) using semi-supervised learning. However, when assigning the level of arousal, the existing method proposed for SentiWordNet reduces the accuracy of sentiment prediction. In this paper, we propose a block-segmentation vector for predicting the arousal levels of many synsets from a small number of labeled words using semi-supervised learning. We analyze the distribution of arousal and non-arousal words in a corpus of sentences by comparing it with the distribution of valence words. We address the problem that arousal level prediction fails when arousal and non-arousal words are mixed together in some sentences. To capture the features of such arousal and non-arousal words, we generate word vectors based on inverted indexes by block IDs, where the corpus is divided into blocks in the flow of sentences. In the evaluation experiment, we show that the results of arousal prediction with the block-segmentation vectors using semi-supervised learning outperform the results of the previous methods in SentiWordNet and SocialSent. •Proposing a block-segmentation vector to predict arousal levels for WordNet synsets.•Analyzing word distribution shows labeling arousal is more difficult than valence.•Feature-selected block-segmentation vectors improve arousal prediction accuracy.•Segmenting a corpus into blocks detects arousal levels in long sentence contexts.</description><identifier>ISSN: 1568-4946</identifier><identifier>EISSN: 1872-9681</identifier><identifier>DOI: 10.1016/j.asoc.2023.110327</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Arousal ; Semi-supervised learning ; Sentiment analysis ; Word embedding</subject><ispartof>Applied soft computing, 2023-07, Vol.142, p.110327, Article 110327</ispartof><rights>2023 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c251t-cb856d2e0b7a696e1cd08aa1df2e60e729539dceee2b010317d3f22b36d951e83</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Odaka, Yuki</creatorcontrib><creatorcontrib>Kaneiwa, Ken</creatorcontrib><title>Block-segmentation vectors for arousal prediction using semi-supervised learning</title><title>Applied soft computing</title><description>To handle emotional expressions in computer applications, Russell’s circumplex model has been useful for representing emotions according to valence and arousal. In SentiWordNet, the level of valence is automatically assigned to a large number of synsets (groups of synonyms in WordNet) using semi-supervised learning. However, when assigning the level of arousal, the existing method proposed for SentiWordNet reduces the accuracy of sentiment prediction. In this paper, we propose a block-segmentation vector for predicting the arousal levels of many synsets from a small number of labeled words using semi-supervised learning. We analyze the distribution of arousal and non-arousal words in a corpus of sentences by comparing it with the distribution of valence words. We address the problem that arousal level prediction fails when arousal and non-arousal words are mixed together in some sentences. To capture the features of such arousal and non-arousal words, we generate word vectors based on inverted indexes by block IDs, where the corpus is divided into blocks in the flow of sentences. In the evaluation experiment, we show that the results of arousal prediction with the block-segmentation vectors using semi-supervised learning outperform the results of the previous methods in SentiWordNet and SocialSent. •Proposing a block-segmentation vector to predict arousal levels for WordNet synsets.•Analyzing word distribution shows labeling arousal is more difficult than valence.•Feature-selected block-segmentation vectors improve arousal prediction accuracy.•Segmenting a corpus into blocks detects arousal levels in long sentence contexts.</description><subject>Arousal</subject><subject>Semi-supervised learning</subject><subject>Sentiment analysis</subject><subject>Word embedding</subject><issn>1568-4946</issn><issn>1872-9681</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kMFKAzEQhoMoWKsv4GlfIGuS7WYT8KJFq1DQg55DNpktqdtNyewWfHtT69nTDPx8w_wfIbeclZxxebctLUZXCiaqknNWieaMzLhqBNVS8fO811LRhV7IS3KFuGUZ0kLNyPtjH90XRdjsYBjtGOJQHMCNMWHRxVTYFCe0fbFP4IP7jScMw6ZA2AWK0x7SISD4ogebhhxck4vO9gg3f3NOPp-fPpYvdP22el0-rKkTNR-pa1UtvQDWNlZqCdx5pqzlvhMgGTRC15X2DgBEy3Ih3viqE6KtpNc1B1XNiTjddSkiJujMPoWdTd-GM3N0Yrbm6MQcnZiTkwzdnyDInx0CJIMuwOByt5RLGx_Df_gPhohslA</recordid><startdate>202307</startdate><enddate>202307</enddate><creator>Odaka, Yuki</creator><creator>Kaneiwa, Ken</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>202307</creationdate><title>Block-segmentation vectors for arousal prediction using semi-supervised learning</title><author>Odaka, Yuki ; Kaneiwa, Ken</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c251t-cb856d2e0b7a696e1cd08aa1df2e60e729539dceee2b010317d3f22b36d951e83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Arousal</topic><topic>Semi-supervised learning</topic><topic>Sentiment analysis</topic><topic>Word embedding</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Odaka, Yuki</creatorcontrib><creatorcontrib>Kaneiwa, Ken</creatorcontrib><collection>CrossRef</collection><jtitle>Applied soft computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Odaka, Yuki</au><au>Kaneiwa, Ken</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Block-segmentation vectors for arousal prediction using semi-supervised learning</atitle><jtitle>Applied soft computing</jtitle><date>2023-07</date><risdate>2023</risdate><volume>142</volume><spage>110327</spage><pages>110327-</pages><artnum>110327</artnum><issn>1568-4946</issn><eissn>1872-9681</eissn><abstract>To handle emotional expressions in computer applications, Russell’s circumplex model has been useful for representing emotions according to valence and arousal. In SentiWordNet, the level of valence is automatically assigned to a large number of synsets (groups of synonyms in WordNet) using semi-supervised learning. However, when assigning the level of arousal, the existing method proposed for SentiWordNet reduces the accuracy of sentiment prediction. In this paper, we propose a block-segmentation vector for predicting the arousal levels of many synsets from a small number of labeled words using semi-supervised learning. We analyze the distribution of arousal and non-arousal words in a corpus of sentences by comparing it with the distribution of valence words. We address the problem that arousal level prediction fails when arousal and non-arousal words are mixed together in some sentences. To capture the features of such arousal and non-arousal words, we generate word vectors based on inverted indexes by block IDs, where the corpus is divided into blocks in the flow of sentences. In the evaluation experiment, we show that the results of arousal prediction with the block-segmentation vectors using semi-supervised learning outperform the results of the previous methods in SentiWordNet and SocialSent. •Proposing a block-segmentation vector to predict arousal levels for WordNet synsets.•Analyzing word distribution shows labeling arousal is more difficult than valence.•Feature-selected block-segmentation vectors improve arousal prediction accuracy.•Segmenting a corpus into blocks detects arousal levels in long sentence contexts.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.asoc.2023.110327</doi></addata></record>
fulltext fulltext
identifier ISSN: 1568-4946
ispartof Applied soft computing, 2023-07, Vol.142, p.110327, Article 110327
issn 1568-4946
1872-9681
language eng
recordid cdi_crossref_primary_10_1016_j_asoc_2023_110327
source ScienceDirect Freedom Collection
subjects Arousal
Semi-supervised learning
Sentiment analysis
Word embedding
title Block-segmentation vectors for arousal prediction using semi-supervised learning
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T23%3A12%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Block-segmentation%20vectors%20for%20arousal%20prediction%20using%20semi-supervised%20learning&rft.jtitle=Applied%20soft%20computing&rft.au=Odaka,%20Yuki&rft.date=2023-07&rft.volume=142&rft.spage=110327&rft.pages=110327-&rft.artnum=110327&rft.issn=1568-4946&rft.eissn=1872-9681&rft_id=info:doi/10.1016/j.asoc.2023.110327&rft_dat=%3Celsevier_cross%3ES1568494623003459%3C/elsevier_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c251t-cb856d2e0b7a696e1cd08aa1df2e60e729539dceee2b010317d3f22b36d951e83%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true