Loading…
Deep Learning Model for COVID-19 Sentiment Analysis on Twitter
The COVID-19 pandemic impacted the mood of the people, and this was evident on social networks. These common user publications are a source of information to measure the population's opinion on social phenomena. In particular, the Twitter network represents a resource of great value due to the...
Saved in:
Published in: | New generation computing 2023, Vol.41 (2), p.189-212 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13 |
---|---|
cites | cdi_FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13 |
container_end_page | 212 |
container_issue | 2 |
container_start_page | 189 |
container_title | New generation computing |
container_volume | 41 |
creator | Contreras Hernández, Salvador Tzili Cruz, María Patricia Espínola Sánchez, José Martín Pérez Tzili, Angélica |
description | The COVID-19 pandemic impacted the mood of the people, and this was evident on social networks. These common user publications are a source of information to measure the population's opinion on social phenomena. In particular, the Twitter network represents a resource of great value due to the amount of information, the geographical distribution of the publications and the openness to dispose of them. This work presents a study on the feelings of the population in Mexico during one of the waves that produced the most contagion and deaths in this country. A mixed, semi-supervised approach was used, with a lexical-based data labeling technique to later bring these data to a pre-trained model of Transformers completely in Spanish. Two Spanish-language models were trained by adding to the Transformers neural network the adjustment for the sentiment analysis task specifically on COVID-19. In addition, ten other multilanguage Transformer models including the Spanish language were trained with the same data set and parameters to compare their performance. In addition, other classifiers with the same data set were used for training and testing, such as Support Vector Machines, Naive Bayes, Logistic Regression, and Decision Trees. These performances were compared with the exclusive model in Spanish based on Transformers, which had higher precision. Finally, this model was used, developed exclusively based on the Spanish language, with new data, to measure the sentiment about COVID-19 of the Twitter community in Mexico. |
doi_str_mv | 10.1007/s00354-023-00209-2 |
format | article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10010651</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2820019525</sourcerecordid><originalsourceid>FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13</originalsourceid><addsrcrecordid>eNp9kTlPAzEQhS0EgnD8AQq0Eg2NwR7Ha28DihIuKSgFgdbaYzYs2qyDvQHx73FICEdBYxfzzZt58wg55OyUM6bOPGNCdikDQRkDllDYIB2uNVDFpNwkHQZaUxELuUN2vX8OeCy6sE12hAJIuGYdcj5AnEVDTF1TNZPozhZYR6V1UX_0eDugPInusWmraXiiXpPW777ykW2i8VvVtuj2yVaZ1h4PVv8eebi6HPdv6HB0fdvvDWneVbKlCLwrdaE4iwuu8qxUWQ5S8kxhBghSx0LpnJVJDDIpteIiS4BjIWJUECMXe-RiqTubZ1Ms8rCOS2szc9U0de_GppX5XWmqJzOxryYcKgyVC4WTlYKzL3P0rZlWPse6Thu0c29AQ0ATCTKgx3_QZzt3wfyC4pqr4CkJFCyp3FnvHZbrbThbjFVmmY8J-ZjPfAyEpqOfPtYtX4EEQCwBH0rNBN337H9kPwAE3Jh1</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2818174759</pqid></control><display><type>article</type><title>Deep Learning Model for COVID-19 Sentiment Analysis on Twitter</title><source>Springer Link</source><creator>Contreras Hernández, Salvador ; Tzili Cruz, María Patricia ; Espínola Sánchez, José Martín ; Pérez Tzili, Angélica</creator><creatorcontrib>Contreras Hernández, Salvador ; Tzili Cruz, María Patricia ; Espínola Sánchez, José Martín ; Pérez Tzili, Angélica</creatorcontrib><description>The COVID-19 pandemic impacted the mood of the people, and this was evident on social networks. These common user publications are a source of information to measure the population's opinion on social phenomena. In particular, the Twitter network represents a resource of great value due to the amount of information, the geographical distribution of the publications and the openness to dispose of them. This work presents a study on the feelings of the population in Mexico during one of the waves that produced the most contagion and deaths in this country. A mixed, semi-supervised approach was used, with a lexical-based data labeling technique to later bring these data to a pre-trained model of Transformers completely in Spanish. Two Spanish-language models were trained by adding to the Transformers neural network the adjustment for the sentiment analysis task specifically on COVID-19. In addition, ten other multilanguage Transformer models including the Spanish language were trained with the same data set and parameters to compare their performance. In addition, other classifiers with the same data set were used for training and testing, such as Support Vector Machines, Naive Bayes, Logistic Regression, and Decision Trees. These performances were compared with the exclusive model in Spanish based on Transformers, which had higher precision. Finally, this model was used, developed exclusively based on the Spanish language, with new data, to measure the sentiment about COVID-19 of the Twitter community in Mexico.</description><identifier>ISSN: 0288-3635</identifier><identifier>EISSN: 1882-7055</identifier><identifier>DOI: 10.1007/s00354-023-00209-2</identifier><identifier>PMID: 37229180</identifier><language>eng</language><publisher>Tokyo: Springer Japan</publisher><subject>Artificial Intelligence ; Computer Hardware ; Computer Science ; Computer Systems Organization and Communication Networks ; Coronaviruses ; COVID-19 ; Data mining ; Datasets ; Decision trees ; Geographical distribution ; Machine learning ; Neural networks ; Sentiment analysis ; Social networks ; Software Engineering/Programming and Operating Systems ; Spanish language ; Support vector machines ; Transformers</subject><ispartof>New generation computing, 2023, Vol.41 (2), p.189-212</ispartof><rights>The Author(s), under exclusive licence to The Japanese Society for Artificial Intelligence and Springer Nature Japan KK, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><rights>The Author(s), under exclusive licence to The Japanese Society for Artificial Intelligence and Springer Nature Japan KK, part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13</citedby><cites>FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13</cites><orcidid>0000-0003-2385-284X ; 0000-0002-7739-8657 ; 0000-0002-6525-770X ; 0000-0002-7631-5565</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,776,780,881,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/37229180$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Contreras Hernández, Salvador</creatorcontrib><creatorcontrib>Tzili Cruz, María Patricia</creatorcontrib><creatorcontrib>Espínola Sánchez, José Martín</creatorcontrib><creatorcontrib>Pérez Tzili, Angélica</creatorcontrib><title>Deep Learning Model for COVID-19 Sentiment Analysis on Twitter</title><title>New generation computing</title><addtitle>New Gener. Comput</addtitle><addtitle>New Gener Comput</addtitle><description>The COVID-19 pandemic impacted the mood of the people, and this was evident on social networks. These common user publications are a source of information to measure the population's opinion on social phenomena. In particular, the Twitter network represents a resource of great value due to the amount of information, the geographical distribution of the publications and the openness to dispose of them. This work presents a study on the feelings of the population in Mexico during one of the waves that produced the most contagion and deaths in this country. A mixed, semi-supervised approach was used, with a lexical-based data labeling technique to later bring these data to a pre-trained model of Transformers completely in Spanish. Two Spanish-language models were trained by adding to the Transformers neural network the adjustment for the sentiment analysis task specifically on COVID-19. In addition, ten other multilanguage Transformer models including the Spanish language were trained with the same data set and parameters to compare their performance. In addition, other classifiers with the same data set were used for training and testing, such as Support Vector Machines, Naive Bayes, Logistic Regression, and Decision Trees. These performances were compared with the exclusive model in Spanish based on Transformers, which had higher precision. Finally, this model was used, developed exclusively based on the Spanish language, with new data, to measure the sentiment about COVID-19 of the Twitter community in Mexico.</description><subject>Artificial Intelligence</subject><subject>Computer Hardware</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Coronaviruses</subject><subject>COVID-19</subject><subject>Data mining</subject><subject>Datasets</subject><subject>Decision trees</subject><subject>Geographical distribution</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Sentiment analysis</subject><subject>Social networks</subject><subject>Software Engineering/Programming and Operating Systems</subject><subject>Spanish language</subject><subject>Support vector machines</subject><subject>Transformers</subject><issn>0288-3635</issn><issn>1882-7055</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kTlPAzEQhS0EgnD8AQq0Eg2NwR7Ha28DihIuKSgFgdbaYzYs2qyDvQHx73FICEdBYxfzzZt58wg55OyUM6bOPGNCdikDQRkDllDYIB2uNVDFpNwkHQZaUxELuUN2vX8OeCy6sE12hAJIuGYdcj5AnEVDTF1TNZPozhZYR6V1UX_0eDugPInusWmraXiiXpPW777ykW2i8VvVtuj2yVaZ1h4PVv8eebi6HPdv6HB0fdvvDWneVbKlCLwrdaE4iwuu8qxUWQ5S8kxhBghSx0LpnJVJDDIpteIiS4BjIWJUECMXe-RiqTubZ1Ms8rCOS2szc9U0de_GppX5XWmqJzOxryYcKgyVC4WTlYKzL3P0rZlWPse6Thu0c29AQ0ATCTKgx3_QZzt3wfyC4pqr4CkJFCyp3FnvHZbrbThbjFVmmY8J-ZjPfAyEpqOfPtYtX4EEQCwBH0rNBN337H9kPwAE3Jh1</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Contreras Hernández, Salvador</creator><creator>Tzili Cruz, María Patricia</creator><creator>Espínola Sánchez, José Martín</creator><creator>Pérez Tzili, Angélica</creator><general>Springer Japan</general><general>Springer Nature B.V</general><general>Ohmsha</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0003-2385-284X</orcidid><orcidid>https://orcid.org/0000-0002-7739-8657</orcidid><orcidid>https://orcid.org/0000-0002-6525-770X</orcidid><orcidid>https://orcid.org/0000-0002-7631-5565</orcidid></search><sort><creationdate>2023</creationdate><title>Deep Learning Model for COVID-19 Sentiment Analysis on Twitter</title><author>Contreras Hernández, Salvador ; Tzili Cruz, María Patricia ; Espínola Sánchez, José Martín ; Pérez Tzili, Angélica</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial Intelligence</topic><topic>Computer Hardware</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Coronaviruses</topic><topic>COVID-19</topic><topic>Data mining</topic><topic>Datasets</topic><topic>Decision trees</topic><topic>Geographical distribution</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Sentiment analysis</topic><topic>Social networks</topic><topic>Software Engineering/Programming and Operating Systems</topic><topic>Spanish language</topic><topic>Support vector machines</topic><topic>Transformers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Contreras Hernández, Salvador</creatorcontrib><creatorcontrib>Tzili Cruz, María Patricia</creatorcontrib><creatorcontrib>Espínola Sánchez, José Martín</creatorcontrib><creatorcontrib>Pérez Tzili, Angélica</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>New generation computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Contreras Hernández, Salvador</au><au>Tzili Cruz, María Patricia</au><au>Espínola Sánchez, José Martín</au><au>Pérez Tzili, Angélica</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Deep Learning Model for COVID-19 Sentiment Analysis on Twitter</atitle><jtitle>New generation computing</jtitle><stitle>New Gener. Comput</stitle><addtitle>New Gener Comput</addtitle><date>2023</date><risdate>2023</risdate><volume>41</volume><issue>2</issue><spage>189</spage><epage>212</epage><pages>189-212</pages><issn>0288-3635</issn><eissn>1882-7055</eissn><abstract>The COVID-19 pandemic impacted the mood of the people, and this was evident on social networks. These common user publications are a source of information to measure the population's opinion on social phenomena. In particular, the Twitter network represents a resource of great value due to the amount of information, the geographical distribution of the publications and the openness to dispose of them. This work presents a study on the feelings of the population in Mexico during one of the waves that produced the most contagion and deaths in this country. A mixed, semi-supervised approach was used, with a lexical-based data labeling technique to later bring these data to a pre-trained model of Transformers completely in Spanish. Two Spanish-language models were trained by adding to the Transformers neural network the adjustment for the sentiment analysis task specifically on COVID-19. In addition, ten other multilanguage Transformer models including the Spanish language were trained with the same data set and parameters to compare their performance. In addition, other classifiers with the same data set were used for training and testing, such as Support Vector Machines, Naive Bayes, Logistic Regression, and Decision Trees. These performances were compared with the exclusive model in Spanish based on Transformers, which had higher precision. Finally, this model was used, developed exclusively based on the Spanish language, with new data, to measure the sentiment about COVID-19 of the Twitter community in Mexico.</abstract><cop>Tokyo</cop><pub>Springer Japan</pub><pmid>37229180</pmid><doi>10.1007/s00354-023-00209-2</doi><tpages>24</tpages><orcidid>https://orcid.org/0000-0003-2385-284X</orcidid><orcidid>https://orcid.org/0000-0002-7739-8657</orcidid><orcidid>https://orcid.org/0000-0002-6525-770X</orcidid><orcidid>https://orcid.org/0000-0002-7631-5565</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0288-3635 |
ispartof | New generation computing, 2023, Vol.41 (2), p.189-212 |
issn | 0288-3635 1882-7055 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10010651 |
source | Springer Link |
subjects | Artificial Intelligence Computer Hardware Computer Science Computer Systems Organization and Communication Networks Coronaviruses COVID-19 Data mining Datasets Decision trees Geographical distribution Machine learning Neural networks Sentiment analysis Social networks Software Engineering/Programming and Operating Systems Spanish language Support vector machines Transformers |
title | Deep Learning Model for COVID-19 Sentiment Analysis on Twitter |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-23T18%3A13%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Deep%20Learning%20Model%20for%20COVID-19%20Sentiment%20Analysis%20on%20Twitter&rft.jtitle=New%20generation%20computing&rft.au=Contreras%20Hern%C3%A1ndez,%20Salvador&rft.date=2023&rft.volume=41&rft.issue=2&rft.spage=189&rft.epage=212&rft.pages=189-212&rft.issn=0288-3635&rft.eissn=1882-7055&rft_id=info:doi/10.1007/s00354-023-00209-2&rft_dat=%3Cproquest_pubme%3E2820019525%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2818174759&rft_id=info:pmid/37229180&rfr_iscdi=true |