Loading…

Deep Learning Model for COVID-19 Sentiment Analysis on Twitter

The COVID-19 pandemic impacted the mood of the people, and this was evident on social networks. These common user publications are a source of information to measure the population's opinion on social phenomena. In particular, the Twitter network represents a resource of great value due to the...

Full description

Saved in:
Bibliographic Details
Published in:New generation computing 2023, Vol.41 (2), p.189-212
Main Authors: Contreras Hernández, Salvador, Tzili Cruz, María Patricia, Espínola Sánchez, José Martín, Pérez Tzili, Angélica
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13
cites cdi_FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13
container_end_page 212
container_issue 2
container_start_page 189
container_title New generation computing
container_volume 41
creator Contreras Hernández, Salvador
Tzili Cruz, María Patricia
Espínola Sánchez, José Martín
Pérez Tzili, Angélica
description The COVID-19 pandemic impacted the mood of the people, and this was evident on social networks. These common user publications are a source of information to measure the population's opinion on social phenomena. In particular, the Twitter network represents a resource of great value due to the amount of information, the geographical distribution of the publications and the openness to dispose of them. This work presents a study on the feelings of the population in Mexico during one of the waves that produced the most contagion and deaths in this country. A mixed, semi-supervised approach was used, with a lexical-based data labeling technique to later bring these data to a pre-trained model of Transformers completely in Spanish. Two Spanish-language models were trained by adding to the Transformers neural network the adjustment for the sentiment analysis task specifically on COVID-19. In addition, ten other multilanguage Transformer models including the Spanish language were trained with the same data set and parameters to compare their performance. In addition, other classifiers with the same data set were used for training and testing, such as Support Vector Machines, Naive Bayes, Logistic Regression, and Decision Trees. These performances were compared with the exclusive model in Spanish based on Transformers, which had higher precision. Finally, this model was used, developed exclusively based on the Spanish language, with new data, to measure the sentiment about COVID-19 of the Twitter community in Mexico.
doi_str_mv 10.1007/s00354-023-00209-2
format article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10010651</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2820019525</sourcerecordid><originalsourceid>FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13</originalsourceid><addsrcrecordid>eNp9kTlPAzEQhS0EgnD8AQq0Eg2NwR7Ha28DihIuKSgFgdbaYzYs2qyDvQHx73FICEdBYxfzzZt58wg55OyUM6bOPGNCdikDQRkDllDYIB2uNVDFpNwkHQZaUxELuUN2vX8OeCy6sE12hAJIuGYdcj5AnEVDTF1TNZPozhZYR6V1UX_0eDugPInusWmraXiiXpPW777ykW2i8VvVtuj2yVaZ1h4PVv8eebi6HPdv6HB0fdvvDWneVbKlCLwrdaE4iwuu8qxUWQ5S8kxhBghSx0LpnJVJDDIpteIiS4BjIWJUECMXe-RiqTubZ1Ms8rCOS2szc9U0de_GppX5XWmqJzOxryYcKgyVC4WTlYKzL3P0rZlWPse6Thu0c29AQ0ATCTKgx3_QZzt3wfyC4pqr4CkJFCyp3FnvHZbrbThbjFVmmY8J-ZjPfAyEpqOfPtYtX4EEQCwBH0rNBN337H9kPwAE3Jh1</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2818174759</pqid></control><display><type>article</type><title>Deep Learning Model for COVID-19 Sentiment Analysis on Twitter</title><source>Springer Link</source><creator>Contreras Hernández, Salvador ; Tzili Cruz, María Patricia ; Espínola Sánchez, José Martín ; Pérez Tzili, Angélica</creator><creatorcontrib>Contreras Hernández, Salvador ; Tzili Cruz, María Patricia ; Espínola Sánchez, José Martín ; Pérez Tzili, Angélica</creatorcontrib><description>The COVID-19 pandemic impacted the mood of the people, and this was evident on social networks. These common user publications are a source of information to measure the population's opinion on social phenomena. In particular, the Twitter network represents a resource of great value due to the amount of information, the geographical distribution of the publications and the openness to dispose of them. This work presents a study on the feelings of the population in Mexico during one of the waves that produced the most contagion and deaths in this country. A mixed, semi-supervised approach was used, with a lexical-based data labeling technique to later bring these data to a pre-trained model of Transformers completely in Spanish. Two Spanish-language models were trained by adding to the Transformers neural network the adjustment for the sentiment analysis task specifically on COVID-19. In addition, ten other multilanguage Transformer models including the Spanish language were trained with the same data set and parameters to compare their performance. In addition, other classifiers with the same data set were used for training and testing, such as Support Vector Machines, Naive Bayes, Logistic Regression, and Decision Trees. These performances were compared with the exclusive model in Spanish based on Transformers, which had higher precision. Finally, this model was used, developed exclusively based on the Spanish language, with new data, to measure the sentiment about COVID-19 of the Twitter community in Mexico.</description><identifier>ISSN: 0288-3635</identifier><identifier>EISSN: 1882-7055</identifier><identifier>DOI: 10.1007/s00354-023-00209-2</identifier><identifier>PMID: 37229180</identifier><language>eng</language><publisher>Tokyo: Springer Japan</publisher><subject>Artificial Intelligence ; Computer Hardware ; Computer Science ; Computer Systems Organization and Communication Networks ; Coronaviruses ; COVID-19 ; Data mining ; Datasets ; Decision trees ; Geographical distribution ; Machine learning ; Neural networks ; Sentiment analysis ; Social networks ; Software Engineering/Programming and Operating Systems ; Spanish language ; Support vector machines ; Transformers</subject><ispartof>New generation computing, 2023, Vol.41 (2), p.189-212</ispartof><rights>The Author(s), under exclusive licence to The Japanese Society for Artificial Intelligence and Springer Nature Japan KK, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><rights>The Author(s), under exclusive licence to The Japanese Society for Artificial Intelligence and Springer Nature Japan KK, part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13</citedby><cites>FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13</cites><orcidid>0000-0003-2385-284X ; 0000-0002-7739-8657 ; 0000-0002-6525-770X ; 0000-0002-7631-5565</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,776,780,881,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/37229180$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Contreras Hernández, Salvador</creatorcontrib><creatorcontrib>Tzili Cruz, María Patricia</creatorcontrib><creatorcontrib>Espínola Sánchez, José Martín</creatorcontrib><creatorcontrib>Pérez Tzili, Angélica</creatorcontrib><title>Deep Learning Model for COVID-19 Sentiment Analysis on Twitter</title><title>New generation computing</title><addtitle>New Gener. Comput</addtitle><addtitle>New Gener Comput</addtitle><description>The COVID-19 pandemic impacted the mood of the people, and this was evident on social networks. These common user publications are a source of information to measure the population's opinion on social phenomena. In particular, the Twitter network represents a resource of great value due to the amount of information, the geographical distribution of the publications and the openness to dispose of them. This work presents a study on the feelings of the population in Mexico during one of the waves that produced the most contagion and deaths in this country. A mixed, semi-supervised approach was used, with a lexical-based data labeling technique to later bring these data to a pre-trained model of Transformers completely in Spanish. Two Spanish-language models were trained by adding to the Transformers neural network the adjustment for the sentiment analysis task specifically on COVID-19. In addition, ten other multilanguage Transformer models including the Spanish language were trained with the same data set and parameters to compare their performance. In addition, other classifiers with the same data set were used for training and testing, such as Support Vector Machines, Naive Bayes, Logistic Regression, and Decision Trees. These performances were compared with the exclusive model in Spanish based on Transformers, which had higher precision. Finally, this model was used, developed exclusively based on the Spanish language, with new data, to measure the sentiment about COVID-19 of the Twitter community in Mexico.</description><subject>Artificial Intelligence</subject><subject>Computer Hardware</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Coronaviruses</subject><subject>COVID-19</subject><subject>Data mining</subject><subject>Datasets</subject><subject>Decision trees</subject><subject>Geographical distribution</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Sentiment analysis</subject><subject>Social networks</subject><subject>Software Engineering/Programming and Operating Systems</subject><subject>Spanish language</subject><subject>Support vector machines</subject><subject>Transformers</subject><issn>0288-3635</issn><issn>1882-7055</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kTlPAzEQhS0EgnD8AQq0Eg2NwR7Ha28DihIuKSgFgdbaYzYs2qyDvQHx73FICEdBYxfzzZt58wg55OyUM6bOPGNCdikDQRkDllDYIB2uNVDFpNwkHQZaUxELuUN2vX8OeCy6sE12hAJIuGYdcj5AnEVDTF1TNZPozhZYR6V1UX_0eDugPInusWmraXiiXpPW777ykW2i8VvVtuj2yVaZ1h4PVv8eebi6HPdv6HB0fdvvDWneVbKlCLwrdaE4iwuu8qxUWQ5S8kxhBghSx0LpnJVJDDIpteIiS4BjIWJUECMXe-RiqTubZ1Ms8rCOS2szc9U0de_GppX5XWmqJzOxryYcKgyVC4WTlYKzL3P0rZlWPse6Thu0c29AQ0ATCTKgx3_QZzt3wfyC4pqr4CkJFCyp3FnvHZbrbThbjFVmmY8J-ZjPfAyEpqOfPtYtX4EEQCwBH0rNBN337H9kPwAE3Jh1</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Contreras Hernández, Salvador</creator><creator>Tzili Cruz, María Patricia</creator><creator>Espínola Sánchez, José Martín</creator><creator>Pérez Tzili, Angélica</creator><general>Springer Japan</general><general>Springer Nature B.V</general><general>Ohmsha</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0003-2385-284X</orcidid><orcidid>https://orcid.org/0000-0002-7739-8657</orcidid><orcidid>https://orcid.org/0000-0002-6525-770X</orcidid><orcidid>https://orcid.org/0000-0002-7631-5565</orcidid></search><sort><creationdate>2023</creationdate><title>Deep Learning Model for COVID-19 Sentiment Analysis on Twitter</title><author>Contreras Hernández, Salvador ; Tzili Cruz, María Patricia ; Espínola Sánchez, José Martín ; Pérez Tzili, Angélica</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial Intelligence</topic><topic>Computer Hardware</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Coronaviruses</topic><topic>COVID-19</topic><topic>Data mining</topic><topic>Datasets</topic><topic>Decision trees</topic><topic>Geographical distribution</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Sentiment analysis</topic><topic>Social networks</topic><topic>Software Engineering/Programming and Operating Systems</topic><topic>Spanish language</topic><topic>Support vector machines</topic><topic>Transformers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Contreras Hernández, Salvador</creatorcontrib><creatorcontrib>Tzili Cruz, María Patricia</creatorcontrib><creatorcontrib>Espínola Sánchez, José Martín</creatorcontrib><creatorcontrib>Pérez Tzili, Angélica</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>New generation computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Contreras Hernández, Salvador</au><au>Tzili Cruz, María Patricia</au><au>Espínola Sánchez, José Martín</au><au>Pérez Tzili, Angélica</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Deep Learning Model for COVID-19 Sentiment Analysis on Twitter</atitle><jtitle>New generation computing</jtitle><stitle>New Gener. Comput</stitle><addtitle>New Gener Comput</addtitle><date>2023</date><risdate>2023</risdate><volume>41</volume><issue>2</issue><spage>189</spage><epage>212</epage><pages>189-212</pages><issn>0288-3635</issn><eissn>1882-7055</eissn><abstract>The COVID-19 pandemic impacted the mood of the people, and this was evident on social networks. These common user publications are a source of information to measure the population's opinion on social phenomena. In particular, the Twitter network represents a resource of great value due to the amount of information, the geographical distribution of the publications and the openness to dispose of them. This work presents a study on the feelings of the population in Mexico during one of the waves that produced the most contagion and deaths in this country. A mixed, semi-supervised approach was used, with a lexical-based data labeling technique to later bring these data to a pre-trained model of Transformers completely in Spanish. Two Spanish-language models were trained by adding to the Transformers neural network the adjustment for the sentiment analysis task specifically on COVID-19. In addition, ten other multilanguage Transformer models including the Spanish language were trained with the same data set and parameters to compare their performance. In addition, other classifiers with the same data set were used for training and testing, such as Support Vector Machines, Naive Bayes, Logistic Regression, and Decision Trees. These performances were compared with the exclusive model in Spanish based on Transformers, which had higher precision. Finally, this model was used, developed exclusively based on the Spanish language, with new data, to measure the sentiment about COVID-19 of the Twitter community in Mexico.</abstract><cop>Tokyo</cop><pub>Springer Japan</pub><pmid>37229180</pmid><doi>10.1007/s00354-023-00209-2</doi><tpages>24</tpages><orcidid>https://orcid.org/0000-0003-2385-284X</orcidid><orcidid>https://orcid.org/0000-0002-7739-8657</orcidid><orcidid>https://orcid.org/0000-0002-6525-770X</orcidid><orcidid>https://orcid.org/0000-0002-7631-5565</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0288-3635
ispartof New generation computing, 2023, Vol.41 (2), p.189-212
issn 0288-3635
1882-7055
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_10010651
source Springer Link
subjects Artificial Intelligence
Computer Hardware
Computer Science
Computer Systems Organization and Communication Networks
Coronaviruses
COVID-19
Data mining
Datasets
Decision trees
Geographical distribution
Machine learning
Neural networks
Sentiment analysis
Social networks
Software Engineering/Programming and Operating Systems
Spanish language
Support vector machines
Transformers
title Deep Learning Model for COVID-19 Sentiment Analysis on Twitter
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-23T18%3A13%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Deep%20Learning%20Model%20for%20COVID-19%20Sentiment%20Analysis%20on%20Twitter&rft.jtitle=New%20generation%20computing&rft.au=Contreras%20Hern%C3%A1ndez,%20Salvador&rft.date=2023&rft.volume=41&rft.issue=2&rft.spage=189&rft.epage=212&rft.pages=189-212&rft.issn=0288-3635&rft.eissn=1882-7055&rft_id=info:doi/10.1007/s00354-023-00209-2&rft_dat=%3Cproquest_pubme%3E2820019525%3C/proquest_pubme%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c475t-e21458d7106d17cbf7bc2551b7eb2e2586378c0f96259f8713b921ed36e726e13%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2818174759&rft_id=info:pmid/37229180&rfr_iscdi=true