Loading…

Multilingual aspect clustering for sentiment analysis

In the last few years, there has been growing interest in aspect-based sentiment analysis, which deals with extracting, clustering, and rating the overall opinion about the features of the entity being evaluated. Techniques for aspect extraction can produce an undesirably large number of aspects — w...

Full description

Saved in:
Bibliographic Details
Published in:Knowledge-based systems 2020-03, Vol.192, p.105339, Article 105339
Main Authors: Pessutto, Lucas Rafael Costella, Vargas, Danny Suarez, Moreira, Viviane P.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c334t-355be610d0c09338cc74e2123074753272dd73121f0184f2a7f741856d9c1c9a3
cites cdi_FETCH-LOGICAL-c334t-355be610d0c09338cc74e2123074753272dd73121f0184f2a7f741856d9c1c9a3
container_end_page
container_issue
container_start_page 105339
container_title Knowledge-based systems
container_volume 192
creator Pessutto, Lucas Rafael Costella
Vargas, Danny Suarez
Moreira, Viviane P.
description In the last few years, there has been growing interest in aspect-based sentiment analysis, which deals with extracting, clustering, and rating the overall opinion about the features of the entity being evaluated. Techniques for aspect extraction can produce an undesirably large number of aspects — with many of those relating to the same product feature. Hence, aspect clustering becomes necessary. Current solutions for aspect clustering are monolingual, but in many practical situations, reviews for a given entity are available in several languages, calling for multilingual integration. In this article, we address the novel task of multilingual aspect clustering, which aims at grouping semantically related aspects extracted from reviews written in several languages. Our method is unsupervised and relies on the contextual information of the aspects, which is represented by word embeddings. This representation allied with a suitable similarity measure allows clustering related aspects. Our experiments on two datasets with five languages each showed that our unsupervised clustering technique achieves results that outperform monolingual baselines adapted to work with multilingual data. We also show the benefits of the multilingual approach compared to using languages in isolation.
doi_str_mv 10.1016/j.knosys.2019.105339
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2427545265</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0950705119306070</els_id><sourcerecordid>2427545265</sourcerecordid><originalsourceid>FETCH-LOGICAL-c334t-355be610d0c09338cc74e2123074753272dd73121f0184f2a7f741856d9c1c9a3</originalsourceid><addsrcrecordid>eNp9kE9LxDAQxYMouK5-Aw8Fz11n8qdpL4IsugorXvQcYppKarddM62w394s9exlBh7vPWZ-jF0jrBCwuG1XX_1AB1pxwCpJSojqhC2w1DzXEqpTtoBKQa5B4Tm7IGoBgHMsF0y9TN0YutB_TrbLLO29GzPXTTT6mMSsGWJGvh_DLo3M9rY7UKBLdtbYjvzV316y98eHt_VTvn3dPK_vt7kTQo65UOrDFwg1OKiEKJ3T0nPkArTUSnDN61oL5NgAlrLhVjdaYqmKunLoKiuW7Gbu3cfhe_I0mnaYYjqCDJdcK6l4oZJLzi4XB6LoG7OPYWfjwSCYIyDTmhmQOQIyM6AUu5tjPn3wE3w05ILvna9DTBRMPYT_C34B2ohvSA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2427545265</pqid></control><display><type>article</type><title>Multilingual aspect clustering for sentiment analysis</title><source>Library &amp; Information Science Abstracts (LISA)</source><source>ScienceDirect Journals</source><creator>Pessutto, Lucas Rafael Costella ; Vargas, Danny Suarez ; Moreira, Viviane P.</creator><creatorcontrib>Pessutto, Lucas Rafael Costella ; Vargas, Danny Suarez ; Moreira, Viviane P.</creatorcontrib><description>In the last few years, there has been growing interest in aspect-based sentiment analysis, which deals with extracting, clustering, and rating the overall opinion about the features of the entity being evaluated. Techniques for aspect extraction can produce an undesirably large number of aspects — with many of those relating to the same product feature. Hence, aspect clustering becomes necessary. Current solutions for aspect clustering are monolingual, but in many practical situations, reviews for a given entity are available in several languages, calling for multilingual integration. In this article, we address the novel task of multilingual aspect clustering, which aims at grouping semantically related aspects extracted from reviews written in several languages. Our method is unsupervised and relies on the contextual information of the aspects, which is represented by word embeddings. This representation allied with a suitable similarity measure allows clustering related aspects. Our experiments on two datasets with five languages each showed that our unsupervised clustering technique achieves results that outperform monolingual baselines adapted to work with multilingual data. We also show the benefits of the multilingual approach compared to using languages in isolation.</description><identifier>ISSN: 0950-7051</identifier><identifier>EISSN: 1872-7409</identifier><identifier>DOI: 10.1016/j.knosys.2019.105339</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Aspect-based sentiment analysis ; Clustering ; Data mining ; Feature extraction ; Languages ; Multilingual aspect clustering ; Multilingualism ; Product specifications ; Sentiment analysis ; Unsupervised learning ; Word embeddings</subject><ispartof>Knowledge-based systems, 2020-03, Vol.192, p.105339, Article 105339</ispartof><rights>2019 Elsevier B.V.</rights><rights>Copyright Elsevier Science Ltd. Mar 15, 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c334t-355be610d0c09338cc74e2123074753272dd73121f0184f2a7f741856d9c1c9a3</citedby><cites>FETCH-LOGICAL-c334t-355be610d0c09338cc74e2123074753272dd73121f0184f2a7f741856d9c1c9a3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,778,782,27911,27912,34122</link.rule.ids></links><search><creatorcontrib>Pessutto, Lucas Rafael Costella</creatorcontrib><creatorcontrib>Vargas, Danny Suarez</creatorcontrib><creatorcontrib>Moreira, Viviane P.</creatorcontrib><title>Multilingual aspect clustering for sentiment analysis</title><title>Knowledge-based systems</title><description>In the last few years, there has been growing interest in aspect-based sentiment analysis, which deals with extracting, clustering, and rating the overall opinion about the features of the entity being evaluated. Techniques for aspect extraction can produce an undesirably large number of aspects — with many of those relating to the same product feature. Hence, aspect clustering becomes necessary. Current solutions for aspect clustering are monolingual, but in many practical situations, reviews for a given entity are available in several languages, calling for multilingual integration. In this article, we address the novel task of multilingual aspect clustering, which aims at grouping semantically related aspects extracted from reviews written in several languages. Our method is unsupervised and relies on the contextual information of the aspects, which is represented by word embeddings. This representation allied with a suitable similarity measure allows clustering related aspects. Our experiments on two datasets with five languages each showed that our unsupervised clustering technique achieves results that outperform monolingual baselines adapted to work with multilingual data. We also show the benefits of the multilingual approach compared to using languages in isolation.</description><subject>Aspect-based sentiment analysis</subject><subject>Clustering</subject><subject>Data mining</subject><subject>Feature extraction</subject><subject>Languages</subject><subject>Multilingual aspect clustering</subject><subject>Multilingualism</subject><subject>Product specifications</subject><subject>Sentiment analysis</subject><subject>Unsupervised learning</subject><subject>Word embeddings</subject><issn>0950-7051</issn><issn>1872-7409</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>F2A</sourceid><recordid>eNp9kE9LxDAQxYMouK5-Aw8Fz11n8qdpL4IsugorXvQcYppKarddM62w394s9exlBh7vPWZ-jF0jrBCwuG1XX_1AB1pxwCpJSojqhC2w1DzXEqpTtoBKQa5B4Tm7IGoBgHMsF0y9TN0YutB_TrbLLO29GzPXTTT6mMSsGWJGvh_DLo3M9rY7UKBLdtbYjvzV316y98eHt_VTvn3dPK_vt7kTQo65UOrDFwg1OKiEKJ3T0nPkArTUSnDN61oL5NgAlrLhVjdaYqmKunLoKiuW7Gbu3cfhe_I0mnaYYjqCDJdcK6l4oZJLzi4XB6LoG7OPYWfjwSCYIyDTmhmQOQIyM6AUu5tjPn3wE3w05ILvna9DTBRMPYT_C34B2ohvSA</recordid><startdate>20200315</startdate><enddate>20200315</enddate><creator>Pessutto, Lucas Rafael Costella</creator><creator>Vargas, Danny Suarez</creator><creator>Moreira, Viviane P.</creator><general>Elsevier B.V</general><general>Elsevier Science Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>E3H</scope><scope>F2A</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20200315</creationdate><title>Multilingual aspect clustering for sentiment analysis</title><author>Pessutto, Lucas Rafael Costella ; Vargas, Danny Suarez ; Moreira, Viviane P.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c334t-355be610d0c09338cc74e2123074753272dd73121f0184f2a7f741856d9c1c9a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Aspect-based sentiment analysis</topic><topic>Clustering</topic><topic>Data mining</topic><topic>Feature extraction</topic><topic>Languages</topic><topic>Multilingual aspect clustering</topic><topic>Multilingualism</topic><topic>Product specifications</topic><topic>Sentiment analysis</topic><topic>Unsupervised learning</topic><topic>Word embeddings</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Pessutto, Lucas Rafael Costella</creatorcontrib><creatorcontrib>Vargas, Danny Suarez</creatorcontrib><creatorcontrib>Moreira, Viviane P.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Knowledge-based systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Pessutto, Lucas Rafael Costella</au><au>Vargas, Danny Suarez</au><au>Moreira, Viviane P.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multilingual aspect clustering for sentiment analysis</atitle><jtitle>Knowledge-based systems</jtitle><date>2020-03-15</date><risdate>2020</risdate><volume>192</volume><spage>105339</spage><pages>105339-</pages><artnum>105339</artnum><issn>0950-7051</issn><eissn>1872-7409</eissn><abstract>In the last few years, there has been growing interest in aspect-based sentiment analysis, which deals with extracting, clustering, and rating the overall opinion about the features of the entity being evaluated. Techniques for aspect extraction can produce an undesirably large number of aspects — with many of those relating to the same product feature. Hence, aspect clustering becomes necessary. Current solutions for aspect clustering are monolingual, but in many practical situations, reviews for a given entity are available in several languages, calling for multilingual integration. In this article, we address the novel task of multilingual aspect clustering, which aims at grouping semantically related aspects extracted from reviews written in several languages. Our method is unsupervised and relies on the contextual information of the aspects, which is represented by word embeddings. This representation allied with a suitable similarity measure allows clustering related aspects. Our experiments on two datasets with five languages each showed that our unsupervised clustering technique achieves results that outperform monolingual baselines adapted to work with multilingual data. We also show the benefits of the multilingual approach compared to using languages in isolation.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.knosys.2019.105339</doi></addata></record>
fulltext fulltext
identifier ISSN: 0950-7051
ispartof Knowledge-based systems, 2020-03, Vol.192, p.105339, Article 105339
issn 0950-7051
1872-7409
language eng
recordid cdi_proquest_journals_2427545265
source Library & Information Science Abstracts (LISA); ScienceDirect Journals
subjects Aspect-based sentiment analysis
Clustering
Data mining
Feature extraction
Languages
Multilingual aspect clustering
Multilingualism
Product specifications
Sentiment analysis
Unsupervised learning
Word embeddings
title Multilingual aspect clustering for sentiment analysis
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T02%3A25%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multilingual%20aspect%20clustering%20for%20sentiment%20analysis&rft.jtitle=Knowledge-based%20systems&rft.au=Pessutto,%20Lucas%20Rafael%20Costella&rft.date=2020-03-15&rft.volume=192&rft.spage=105339&rft.pages=105339-&rft.artnum=105339&rft.issn=0950-7051&rft.eissn=1872-7409&rft_id=info:doi/10.1016/j.knosys.2019.105339&rft_dat=%3Cproquest_cross%3E2427545265%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c334t-355be610d0c09338cc74e2123074753272dd73121f0184f2a7f741856d9c1c9a3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2427545265&rft_id=info:pmid/&rfr_iscdi=true