Loading…

RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms

There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always st...

Full description

Saved in:

Bibliographic Details
Published in:	Scientific reports 2024-11, Vol.14 (1), p.28870-15, Article 28870
Main Authors:	Sennary, Hameda A., Abozaid, Ghada, Hemeida, Ashraf, Mikhaylov, Alexey
Format:	Article
Language:	English
Subjects:	639/705/1041 639/705/1042 639/705/117 639/705/794 Cyberbullying Deep learning Hate speech Humanities and Social Sciences Learning algorithms multidisciplinary Neural networks Science Science (multidisciplinary) Social discrimination learning Social networks Social organization Toxicity
Citations:	Items that this one cites
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites	cdi_FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553
container_end_page	15
container_issue	1
container_start_page	28870
container_title	Scientific reports
container_volume	14
creator	Sennary, Hameda A. Abozaid, Ghada Hemeida, Ashraf Mikhaylov, Alexey
description	There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always strive to find a solution. One of these disadvantages is sharing hate speech. In our study, we’re discussing a way to solve this phenomenon by using Term Frequency-Inverse Document Frequency (TF-IDF) based approach to feature engineering on eleven classifiers for machine and deep learning that can automatically identify hate speech. Three different databases were used, the first of which “Hate speech offensive tweets by Davidson et al.”, the second called "Twitter hate speech" and finally we merged the second data with (Cyberbullying dataset (toxicity_parsed_dataset)". The classifiers involved are Logistic Regression (LR), Naive Bayes (NB), Multi-layer Perceptron (MLP), and Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), K-Means, Decision Tree (DT), Gradient Boosting classifier (GBC), and the Extra Trees (ET) in addition to the convolutional neural network (CNN). Maximum accuracy was attained, which exceeded 99%.
doi_str_mv	10.1038/s41598-024-76632-2
format	article
fullrecord	<record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_c2109cb32cf24bd2bc01bfed1e8df31a</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_c2109cb32cf24bd2bc01bfed1e8df31a</doaj_id><sourcerecordid>3131663756</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553</originalsourceid><addsrcrecordid>eNp9kU-P0zAQxSMEEqtlvwAnS5wD9vhPkr1V3QKVKiFV5WxNnEmbksbFdne13x5vg4ATc5mR9d5vNH5F8V7wj4LL-lNUQjd1yUGVlTESSnhV3ABXugQJ8Pqf-W1xF-OR59LQKNHcFI_b1W67WO5WD2yx3a2Xm9U9e6BELg1-Yr5nB0x0z-KZyB1YeiJKkbUYqWPOT49-vLwIcWQTXcK1pScffjCcOnZCdxgmYiNhmIZpz3Dc-zCkwym-K970OEa6-91vi--fV7vl13Lz7ct6udiUTtQCStMpVKA11pgvM4Ya1MZUgFDLjhqoDAgOUvSKC5JYKUJsRQ8KG-oqreVtsZ65ncejPYfhhOHZehzs9cGHvcWQBjeSdRnVuFaCy_62g9Zx0fbUCaq7XgrMrA8z6xz8zwvFZI_-EvLt0UohRf75Spusglnlgo8xUP9nq-D2JS47x2VzXPYal4VskrMpZvG0p_AX_R_XLzPalwk</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3131663756</pqid></control><display><type>article</type><title>RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms</title><source>Publicly Available Content Database</source><source>Full-Text Journals in Chemistry (Open access)</source><source>PubMed Central</source><source>Coronavirus Research Database</source><source>Springer Nature - nature.com Journals - Fully Open Access</source><creator>Sennary, Hameda A. ; Abozaid, Ghada ; Hemeida, Ashraf ; Mikhaylov, Alexey</creator><creatorcontrib>Sennary, Hameda A. ; Abozaid, Ghada ; Hemeida, Ashraf ; Mikhaylov, Alexey</creatorcontrib><description>There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always strive to find a solution. One of these disadvantages is sharing hate speech. In our study, we’re discussing a way to solve this phenomenon by using Term Frequency-Inverse Document Frequency (TF-IDF) based approach to feature engineering on eleven classifiers for machine and deep learning that can automatically identify hate speech. Three different databases were used, the first of which “Hate speech offensive tweets by Davidson et al.”, the second called "Twitter hate speech" and finally we merged the second data with (Cyberbullying dataset (toxicity_parsed_dataset)". The classifiers involved are Logistic Regression (LR), Naive Bayes (NB), Multi-layer Perceptron (MLP), and Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), K-Means, Decision Tree (DT), Gradient Boosting classifier (GBC), and the Extra Trees (ET) in addition to the convolutional neural network (CNN). Maximum accuracy was attained, which exceeded 99%.</description><identifier>ISSN: 2045-2322</identifier><identifier>EISSN: 2045-2322</identifier><identifier>DOI: 10.1038/s41598-024-76632-2</identifier><language>eng</language><publisher>London: Nature Publishing Group UK</publisher><subject>639/705/1041 ; 639/705/1042 ; 639/705/117 ; 639/705/794 ; Cyberbullying ; Deep learning ; Hate speech ; Humanities and Social Sciences ; Learning algorithms ; multidisciplinary ; Neural networks ; Science ; Science (multidisciplinary) ; Social discrimination learning ; Social networks ; Social organization ; Toxicity</subject><ispartof>Scientific reports, 2024-11, Vol.14 (1), p.28870-15, Article 28870</ispartof><rights>The Author(s) 2024</rights><rights>Copyright Nature Publishing Group 2024</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/3131663756?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/3131663756?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,25751,27922,27923,37010,38514,43893,44588,74182,74896</link.rule.ids></links><search><creatorcontrib>Sennary, Hameda A.</creatorcontrib><creatorcontrib>Abozaid, Ghada</creatorcontrib><creatorcontrib>Hemeida, Ashraf</creatorcontrib><creatorcontrib>Mikhaylov, Alexey</creatorcontrib><title>RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms</title><title>Scientific reports</title><addtitle>Sci Rep</addtitle><description>There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always strive to find a solution. One of these disadvantages is sharing hate speech. In our study, we’re discussing a way to solve this phenomenon by using Term Frequency-Inverse Document Frequency (TF-IDF) based approach to feature engineering on eleven classifiers for machine and deep learning that can automatically identify hate speech. Three different databases were used, the first of which “Hate speech offensive tweets by Davidson et al.”, the second called "Twitter hate speech" and finally we merged the second data with (Cyberbullying dataset (toxicity_parsed_dataset)". The classifiers involved are Logistic Regression (LR), Naive Bayes (NB), Multi-layer Perceptron (MLP), and Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), K-Means, Decision Tree (DT), Gradient Boosting classifier (GBC), and the Extra Trees (ET) in addition to the convolutional neural network (CNN). Maximum accuracy was attained, which exceeded 99%.</description><subject>639/705/1041</subject><subject>639/705/1042</subject><subject>639/705/117</subject><subject>639/705/794</subject><subject>Cyberbullying</subject><subject>Deep learning</subject><subject>Hate speech</subject><subject>Humanities and Social Sciences</subject><subject>Learning algorithms</subject><subject>multidisciplinary</subject><subject>Neural networks</subject><subject>Science</subject><subject>Science (multidisciplinary)</subject><subject>Social discrimination learning</subject><subject>Social networks</subject><subject>Social organization</subject><subject>Toxicity</subject><issn>2045-2322</issn><issn>2045-2322</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>COVID</sourceid><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNp9kU-P0zAQxSMEEqtlvwAnS5wD9vhPkr1V3QKVKiFV5WxNnEmbksbFdne13x5vg4ATc5mR9d5vNH5F8V7wj4LL-lNUQjd1yUGVlTESSnhV3ABXugQJ8Pqf-W1xF-OR59LQKNHcFI_b1W67WO5WD2yx3a2Xm9U9e6BELg1-Yr5nB0x0z-KZyB1YeiJKkbUYqWPOT49-vLwIcWQTXcK1pScffjCcOnZCdxgmYiNhmIZpz3Dc-zCkwym-K970OEa6-91vi--fV7vl13Lz7ct6udiUTtQCStMpVKA11pgvM4Ya1MZUgFDLjhqoDAgOUvSKC5JYKUJsRQ8KG-oqreVtsZ65ncejPYfhhOHZehzs9cGHvcWQBjeSdRnVuFaCy_62g9Zx0fbUCaq7XgrMrA8z6xz8zwvFZI_-EvLt0UohRf75Spusglnlgo8xUP9nq-D2JS47x2VzXPYal4VskrMpZvG0p_AX_R_XLzPalwk</recordid><startdate>20241121</startdate><enddate>20241121</enddate><creator>Sennary, Hameda A.</creator><creator>Abozaid, Ghada</creator><creator>Hemeida, Ashraf</creator><creator>Mikhaylov, Alexey</creator><general>Nature Publishing Group UK</general><general>Nature Publishing Group</general><general>Nature Portfolio</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>COVID</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>DOA</scope></search><sort><creationdate>20241121</creationdate><title>RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms</title><author>Sennary, Hameda A. ; Abozaid, Ghada ; Hemeida, Ashraf ; Mikhaylov, Alexey</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>639/705/1041</topic><topic>639/705/1042</topic><topic>639/705/117</topic><topic>639/705/794</topic><topic>Cyberbullying</topic><topic>Deep learning</topic><topic>Hate speech</topic><topic>Humanities and Social Sciences</topic><topic>Learning algorithms</topic><topic>multidisciplinary</topic><topic>Neural networks</topic><topic>Science</topic><topic>Science (multidisciplinary)</topic><topic>Social discrimination learning</topic><topic>Social networks</topic><topic>Social organization</topic><topic>Toxicity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sennary, Hameda A.</creatorcontrib><creatorcontrib>Abozaid, Ghada</creatorcontrib><creatorcontrib>Hemeida, Ashraf</creatorcontrib><creatorcontrib>Mikhaylov, Alexey</creatorcontrib><collection>SpringerOpen</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>Coronavirus Research Database</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Biological Sciences</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Scientific reports</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sennary, Hameda A.</au><au>Abozaid, Ghada</au><au>Hemeida, Ashraf</au><au>Mikhaylov, Alexey</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms</atitle><jtitle>Scientific reports</jtitle><stitle>Sci Rep</stitle><date>2024-11-21</date><risdate>2024</risdate><volume>14</volume><issue>1</issue><spage>28870</spage><epage>15</epage><pages>28870-15</pages><artnum>28870</artnum><issn>2045-2322</issn><eissn>2045-2322</eissn><abstract>There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always strive to find a solution. One of these disadvantages is sharing hate speech. In our study, we’re discussing a way to solve this phenomenon by using Term Frequency-Inverse Document Frequency (TF-IDF) based approach to feature engineering on eleven classifiers for machine and deep learning that can automatically identify hate speech. Three different databases were used, the first of which “Hate speech offensive tweets by Davidson et al.”, the second called "Twitter hate speech" and finally we merged the second data with (Cyberbullying dataset (toxicity_parsed_dataset)". The classifiers involved are Logistic Regression (LR), Naive Bayes (NB), Multi-layer Perceptron (MLP), and Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), K-Means, Decision Tree (DT), Gradient Boosting classifier (GBC), and the Extra Trees (ET) in addition to the convolutional neural network (CNN). Maximum accuracy was attained, which exceeded 99%.</abstract><cop>London</cop><pub>Nature Publishing Group UK</pub><doi>10.1038/s41598-024-76632-2</doi><tpages>15</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2045-2322
ispartof	Scientific reports, 2024-11, Vol.14 (1), p.28870-15, Article 28870
issn	2045-2322 2045-2322
language	eng
recordid	cdi_doaj_primary_oai_doaj_org_article_c2109cb32cf24bd2bc01bfed1e8df31a
source	Publicly Available Content Database; Full-Text Journals in Chemistry (Open access); PubMed Central; Coronavirus Research Database; Springer Nature - nature.com Journals - Fully Open Access
subjects	639/705/1041 639/705/1042 639/705/117 639/705/794 Cyberbullying Deep learning Hate speech Humanities and Social Sciences Learning algorithms multidisciplinary Neural networks Science Science (multidisciplinary) Social discrimination learning Social networks Social organization Toxicity
title	RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T20%3A20%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RETRACTED%20ARTICLE:%20Detection%20of%20hate:%20speech%20tweets%20based%20convolutional%20neural%20network%20and%20machine%20learning%20algorithms&rft.jtitle=Scientific%20reports&rft.au=Sennary,%20Hameda%20A.&rft.date=2024-11-21&rft.volume=14&rft.issue=1&rft.spage=28870&rft.epage=15&rft.pages=28870-15&rft.artnum=28870&rft.issn=2045-2322&rft.eissn=2045-2322&rft_id=info:doi/10.1038/s41598-024-76632-2&rft_dat=%3Cproquest_doaj_%3E3131663756%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3131663756&rft_id=info:pmid/&rfr_iscdi=true