Loading…

RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms

There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always st...

Full description

Saved in:
Bibliographic Details
Published in:Scientific reports 2024-11, Vol.14 (1), p.28870-15, Article 28870
Main Authors: Sennary, Hameda A., Abozaid, Ghada, Hemeida, Ashraf, Mikhaylov, Alexey
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553
container_end_page 15
container_issue 1
container_start_page 28870
container_title Scientific reports
container_volume 14
creator Sennary, Hameda A.
Abozaid, Ghada
Hemeida, Ashraf
Mikhaylov, Alexey
description There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always strive to find a solution. One of these disadvantages is sharing hate speech. In our study, we’re discussing a way to solve this phenomenon by using Term Frequency-Inverse Document Frequency (TF-IDF) based approach to feature engineering on eleven classifiers for machine and deep learning that can automatically identify hate speech. Three different databases were used, the first of which “Hate speech offensive tweets by Davidson et al.”, the second called "Twitter hate speech" and finally we merged the second data with (Cyberbullying dataset (toxicity_parsed_dataset)". The classifiers involved are Logistic Regression (LR), Naive Bayes (NB), Multi-layer Perceptron (MLP), and Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), K-Means, Decision Tree (DT), Gradient Boosting classifier (GBC), and the Extra Trees (ET) in addition to the convolutional neural network (CNN). Maximum accuracy was attained, which exceeded 99%.
doi_str_mv 10.1038/s41598-024-76632-2
format article
fullrecord <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_c2109cb32cf24bd2bc01bfed1e8df31a</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_c2109cb32cf24bd2bc01bfed1e8df31a</doaj_id><sourcerecordid>3131663756</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553</originalsourceid><addsrcrecordid>eNp9kU-P0zAQxSMEEqtlvwAnS5wD9vhPkr1V3QKVKiFV5WxNnEmbksbFdne13x5vg4ATc5mR9d5vNH5F8V7wj4LL-lNUQjd1yUGVlTESSnhV3ABXugQJ8Pqf-W1xF-OR59LQKNHcFI_b1W67WO5WD2yx3a2Xm9U9e6BELg1-Yr5nB0x0z-KZyB1YeiJKkbUYqWPOT49-vLwIcWQTXcK1pScffjCcOnZCdxgmYiNhmIZpz3Dc-zCkwym-K970OEa6-91vi--fV7vl13Lz7ct6udiUTtQCStMpVKA11pgvM4Ya1MZUgFDLjhqoDAgOUvSKC5JYKUJsRQ8KG-oqreVtsZ65ncejPYfhhOHZehzs9cGHvcWQBjeSdRnVuFaCy_62g9Zx0fbUCaq7XgrMrA8z6xz8zwvFZI_-EvLt0UohRf75Spusglnlgo8xUP9nq-D2JS47x2VzXPYal4VskrMpZvG0p_AX_R_XLzPalwk</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3131663756</pqid></control><display><type>article</type><title>RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms</title><source>Publicly Available Content Database</source><source>Full-Text Journals in Chemistry (Open access)</source><source>PubMed Central</source><source>Coronavirus Research Database</source><source>Springer Nature - nature.com Journals - Fully Open Access</source><creator>Sennary, Hameda A. ; Abozaid, Ghada ; Hemeida, Ashraf ; Mikhaylov, Alexey</creator><creatorcontrib>Sennary, Hameda A. ; Abozaid, Ghada ; Hemeida, Ashraf ; Mikhaylov, Alexey</creatorcontrib><description>There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always strive to find a solution. One of these disadvantages is sharing hate speech. In our study, we’re discussing a way to solve this phenomenon by using Term Frequency-Inverse Document Frequency (TF-IDF) based approach to feature engineering on eleven classifiers for machine and deep learning that can automatically identify hate speech. Three different databases were used, the first of which “Hate speech offensive tweets by Davidson et al.”, the second called "Twitter hate speech" and finally we merged the second data with (Cyberbullying dataset (toxicity_parsed_dataset)". The classifiers involved are Logistic Regression (LR), Naive Bayes (NB), Multi-layer Perceptron (MLP), and Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), K-Means, Decision Tree (DT), Gradient Boosting classifier (GBC), and the Extra Trees (ET) in addition to the convolutional neural network (CNN). Maximum accuracy was attained, which exceeded 99%.</description><identifier>ISSN: 2045-2322</identifier><identifier>EISSN: 2045-2322</identifier><identifier>DOI: 10.1038/s41598-024-76632-2</identifier><language>eng</language><publisher>London: Nature Publishing Group UK</publisher><subject>639/705/1041 ; 639/705/1042 ; 639/705/117 ; 639/705/794 ; Cyberbullying ; Deep learning ; Hate speech ; Humanities and Social Sciences ; Learning algorithms ; multidisciplinary ; Neural networks ; Science ; Science (multidisciplinary) ; Social discrimination learning ; Social networks ; Social organization ; Toxicity</subject><ispartof>Scientific reports, 2024-11, Vol.14 (1), p.28870-15, Article 28870</ispartof><rights>The Author(s) 2024</rights><rights>Copyright Nature Publishing Group 2024</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/3131663756?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/3131663756?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,25751,27922,27923,37010,38514,43893,44588,74182,74896</link.rule.ids></links><search><creatorcontrib>Sennary, Hameda A.</creatorcontrib><creatorcontrib>Abozaid, Ghada</creatorcontrib><creatorcontrib>Hemeida, Ashraf</creatorcontrib><creatorcontrib>Mikhaylov, Alexey</creatorcontrib><title>RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms</title><title>Scientific reports</title><addtitle>Sci Rep</addtitle><description>There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always strive to find a solution. One of these disadvantages is sharing hate speech. In our study, we’re discussing a way to solve this phenomenon by using Term Frequency-Inverse Document Frequency (TF-IDF) based approach to feature engineering on eleven classifiers for machine and deep learning that can automatically identify hate speech. Three different databases were used, the first of which “Hate speech offensive tweets by Davidson et al.”, the second called "Twitter hate speech" and finally we merged the second data with (Cyberbullying dataset (toxicity_parsed_dataset)". The classifiers involved are Logistic Regression (LR), Naive Bayes (NB), Multi-layer Perceptron (MLP), and Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), K-Means, Decision Tree (DT), Gradient Boosting classifier (GBC), and the Extra Trees (ET) in addition to the convolutional neural network (CNN). Maximum accuracy was attained, which exceeded 99%.</description><subject>639/705/1041</subject><subject>639/705/1042</subject><subject>639/705/117</subject><subject>639/705/794</subject><subject>Cyberbullying</subject><subject>Deep learning</subject><subject>Hate speech</subject><subject>Humanities and Social Sciences</subject><subject>Learning algorithms</subject><subject>multidisciplinary</subject><subject>Neural networks</subject><subject>Science</subject><subject>Science (multidisciplinary)</subject><subject>Social discrimination learning</subject><subject>Social networks</subject><subject>Social organization</subject><subject>Toxicity</subject><issn>2045-2322</issn><issn>2045-2322</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>COVID</sourceid><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNp9kU-P0zAQxSMEEqtlvwAnS5wD9vhPkr1V3QKVKiFV5WxNnEmbksbFdne13x5vg4ATc5mR9d5vNH5F8V7wj4LL-lNUQjd1yUGVlTESSnhV3ABXugQJ8Pqf-W1xF-OR59LQKNHcFI_b1W67WO5WD2yx3a2Xm9U9e6BELg1-Yr5nB0x0z-KZyB1YeiJKkbUYqWPOT49-vLwIcWQTXcK1pScffjCcOnZCdxgmYiNhmIZpz3Dc-zCkwym-K970OEa6-91vi--fV7vl13Lz7ct6udiUTtQCStMpVKA11pgvM4Ya1MZUgFDLjhqoDAgOUvSKC5JYKUJsRQ8KG-oqreVtsZ65ncejPYfhhOHZehzs9cGHvcWQBjeSdRnVuFaCy_62g9Zx0fbUCaq7XgrMrA8z6xz8zwvFZI_-EvLt0UohRf75Spusglnlgo8xUP9nq-D2JS47x2VzXPYal4VskrMpZvG0p_AX_R_XLzPalwk</recordid><startdate>20241121</startdate><enddate>20241121</enddate><creator>Sennary, Hameda A.</creator><creator>Abozaid, Ghada</creator><creator>Hemeida, Ashraf</creator><creator>Mikhaylov, Alexey</creator><general>Nature Publishing Group UK</general><general>Nature Publishing Group</general><general>Nature Portfolio</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>COVID</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>DOA</scope></search><sort><creationdate>20241121</creationdate><title>RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms</title><author>Sennary, Hameda A. ; Abozaid, Ghada ; Hemeida, Ashraf ; Mikhaylov, Alexey</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>639/705/1041</topic><topic>639/705/1042</topic><topic>639/705/117</topic><topic>639/705/794</topic><topic>Cyberbullying</topic><topic>Deep learning</topic><topic>Hate speech</topic><topic>Humanities and Social Sciences</topic><topic>Learning algorithms</topic><topic>multidisciplinary</topic><topic>Neural networks</topic><topic>Science</topic><topic>Science (multidisciplinary)</topic><topic>Social discrimination learning</topic><topic>Social networks</topic><topic>Social organization</topic><topic>Toxicity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sennary, Hameda A.</creatorcontrib><creatorcontrib>Abozaid, Ghada</creatorcontrib><creatorcontrib>Hemeida, Ashraf</creatorcontrib><creatorcontrib>Mikhaylov, Alexey</creatorcontrib><collection>SpringerOpen</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>Coronavirus Research Database</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Biological Sciences</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Scientific reports</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sennary, Hameda A.</au><au>Abozaid, Ghada</au><au>Hemeida, Ashraf</au><au>Mikhaylov, Alexey</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms</atitle><jtitle>Scientific reports</jtitle><stitle>Sci Rep</stitle><date>2024-11-21</date><risdate>2024</risdate><volume>14</volume><issue>1</issue><spage>28870</spage><epage>15</epage><pages>28870-15</pages><artnum>28870</artnum><issn>2045-2322</issn><eissn>2045-2322</eissn><abstract>There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always strive to find a solution. One of these disadvantages is sharing hate speech. In our study, we’re discussing a way to solve this phenomenon by using Term Frequency-Inverse Document Frequency (TF-IDF) based approach to feature engineering on eleven classifiers for machine and deep learning that can automatically identify hate speech. Three different databases were used, the first of which “Hate speech offensive tweets by Davidson et al.”, the second called "Twitter hate speech" and finally we merged the second data with (Cyberbullying dataset (toxicity_parsed_dataset)". The classifiers involved are Logistic Regression (LR), Naive Bayes (NB), Multi-layer Perceptron (MLP), and Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), K-Means, Decision Tree (DT), Gradient Boosting classifier (GBC), and the Extra Trees (ET) in addition to the convolutional neural network (CNN). Maximum accuracy was attained, which exceeded 99%.</abstract><cop>London</cop><pub>Nature Publishing Group UK</pub><doi>10.1038/s41598-024-76632-2</doi><tpages>15</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2045-2322
ispartof Scientific reports, 2024-11, Vol.14 (1), p.28870-15, Article 28870
issn 2045-2322
2045-2322
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_c2109cb32cf24bd2bc01bfed1e8df31a
source Publicly Available Content Database; Full-Text Journals in Chemistry (Open access); PubMed Central; Coronavirus Research Database; Springer Nature - nature.com Journals - Fully Open Access
subjects 639/705/1041
639/705/1042
639/705/117
639/705/794
Cyberbullying
Deep learning
Hate speech
Humanities and Social Sciences
Learning algorithms
multidisciplinary
Neural networks
Science
Science (multidisciplinary)
Social discrimination learning
Social networks
Social organization
Toxicity
title RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T20%3A20%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RETRACTED%20ARTICLE:%20Detection%20of%20hate:%20speech%20tweets%20based%20convolutional%20neural%20network%20and%20machine%20learning%20algorithms&rft.jtitle=Scientific%20reports&rft.au=Sennary,%20Hameda%20A.&rft.date=2024-11-21&rft.volume=14&rft.issue=1&rft.spage=28870&rft.epage=15&rft.pages=28870-15&rft.artnum=28870&rft.issn=2045-2322&rft.eissn=2045-2322&rft_id=info:doi/10.1038/s41598-024-76632-2&rft_dat=%3Cproquest_doaj_%3E3131663756%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3131663756&rft_id=info:pmid/&rfr_iscdi=true