Loading…
RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms
There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always st...
Saved in:
Published in: | Scientific reports 2024-11, Vol.14 (1), p.28870-15, Article 28870 |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | cdi_FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553 |
container_end_page | 15 |
container_issue | 1 |
container_start_page | 28870 |
container_title | Scientific reports |
container_volume | 14 |
creator | Sennary, Hameda A. Abozaid, Ghada Hemeida, Ashraf Mikhaylov, Alexey |
description | There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always strive to find a solution. One of these disadvantages is sharing hate speech. In our study, we’re discussing a way to solve this phenomenon by using Term Frequency-Inverse Document Frequency (TF-IDF) based approach to feature engineering on eleven classifiers for machine and deep learning that can automatically identify hate speech. Three different databases were used, the first of which “Hate speech offensive tweets by Davidson et al.”, the second called "Twitter hate speech" and finally we merged the second data with (Cyberbullying dataset (toxicity_parsed_dataset)". The classifiers involved are Logistic Regression (LR), Naive Bayes (NB), Multi-layer Perceptron (MLP), and Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), K-Means, Decision Tree (DT), Gradient Boosting classifier (GBC), and the Extra Trees (ET) in addition to the convolutional neural network (CNN). Maximum accuracy was attained, which exceeded 99%. |
doi_str_mv | 10.1038/s41598-024-76632-2 |
format | article |
fullrecord | <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_c2109cb32cf24bd2bc01bfed1e8df31a</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_c2109cb32cf24bd2bc01bfed1e8df31a</doaj_id><sourcerecordid>3131663756</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553</originalsourceid><addsrcrecordid>eNp9kU-P0zAQxSMEEqtlvwAnS5wD9vhPkr1V3QKVKiFV5WxNnEmbksbFdne13x5vg4ATc5mR9d5vNH5F8V7wj4LL-lNUQjd1yUGVlTESSnhV3ABXugQJ8Pqf-W1xF-OR59LQKNHcFI_b1W67WO5WD2yx3a2Xm9U9e6BELg1-Yr5nB0x0z-KZyB1YeiJKkbUYqWPOT49-vLwIcWQTXcK1pScffjCcOnZCdxgmYiNhmIZpz3Dc-zCkwym-K970OEa6-91vi--fV7vl13Lz7ct6udiUTtQCStMpVKA11pgvM4Ya1MZUgFDLjhqoDAgOUvSKC5JYKUJsRQ8KG-oqreVtsZ65ncejPYfhhOHZehzs9cGHvcWQBjeSdRnVuFaCy_62g9Zx0fbUCaq7XgrMrA8z6xz8zwvFZI_-EvLt0UohRf75Spusglnlgo8xUP9nq-D2JS47x2VzXPYal4VskrMpZvG0p_AX_R_XLzPalwk</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3131663756</pqid></control><display><type>article</type><title>RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms</title><source>Publicly Available Content Database</source><source>Full-Text Journals in Chemistry (Open access)</source><source>PubMed Central</source><source>Coronavirus Research Database</source><source>Springer Nature - nature.com Journals - Fully Open Access</source><creator>Sennary, Hameda A. ; Abozaid, Ghada ; Hemeida, Ashraf ; Mikhaylov, Alexey</creator><creatorcontrib>Sennary, Hameda A. ; Abozaid, Ghada ; Hemeida, Ashraf ; Mikhaylov, Alexey</creatorcontrib><description>There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always strive to find a solution. One of these disadvantages is sharing hate speech. In our study, we’re discussing a way to solve this phenomenon by using Term Frequency-Inverse Document Frequency (TF-IDF) based approach to feature engineering on eleven classifiers for machine and deep learning that can automatically identify hate speech. Three different databases were used, the first of which “Hate speech offensive tweets by Davidson et al.”, the second called "Twitter hate speech" and finally we merged the second data with (Cyberbullying dataset (toxicity_parsed_dataset)". The classifiers involved are Logistic Regression (LR), Naive Bayes (NB), Multi-layer Perceptron (MLP), and Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), K-Means, Decision Tree (DT), Gradient Boosting classifier (GBC), and the Extra Trees (ET) in addition to the convolutional neural network (CNN). Maximum accuracy was attained, which exceeded 99%.</description><identifier>ISSN: 2045-2322</identifier><identifier>EISSN: 2045-2322</identifier><identifier>DOI: 10.1038/s41598-024-76632-2</identifier><language>eng</language><publisher>London: Nature Publishing Group UK</publisher><subject>639/705/1041 ; 639/705/1042 ; 639/705/117 ; 639/705/794 ; Cyberbullying ; Deep learning ; Hate speech ; Humanities and Social Sciences ; Learning algorithms ; multidisciplinary ; Neural networks ; Science ; Science (multidisciplinary) ; Social discrimination learning ; Social networks ; Social organization ; Toxicity</subject><ispartof>Scientific reports, 2024-11, Vol.14 (1), p.28870-15, Article 28870</ispartof><rights>The Author(s) 2024</rights><rights>Copyright Nature Publishing Group 2024</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.proquest.com/docview/3131663756?pq-origsite=primo$$EPDF$$P50$$Gproquest$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.proquest.com/docview/3131663756?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,25751,27922,27923,37010,38514,43893,44588,74182,74896</link.rule.ids></links><search><creatorcontrib>Sennary, Hameda A.</creatorcontrib><creatorcontrib>Abozaid, Ghada</creatorcontrib><creatorcontrib>Hemeida, Ashraf</creatorcontrib><creatorcontrib>Mikhaylov, Alexey</creatorcontrib><title>RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms</title><title>Scientific reports</title><addtitle>Sci Rep</addtitle><description>There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always strive to find a solution. One of these disadvantages is sharing hate speech. In our study, we’re discussing a way to solve this phenomenon by using Term Frequency-Inverse Document Frequency (TF-IDF) based approach to feature engineering on eleven classifiers for machine and deep learning that can automatically identify hate speech. Three different databases were used, the first of which “Hate speech offensive tweets by Davidson et al.”, the second called "Twitter hate speech" and finally we merged the second data with (Cyberbullying dataset (toxicity_parsed_dataset)". The classifiers involved are Logistic Regression (LR), Naive Bayes (NB), Multi-layer Perceptron (MLP), and Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), K-Means, Decision Tree (DT), Gradient Boosting classifier (GBC), and the Extra Trees (ET) in addition to the convolutional neural network (CNN). Maximum accuracy was attained, which exceeded 99%.</description><subject>639/705/1041</subject><subject>639/705/1042</subject><subject>639/705/117</subject><subject>639/705/794</subject><subject>Cyberbullying</subject><subject>Deep learning</subject><subject>Hate speech</subject><subject>Humanities and Social Sciences</subject><subject>Learning algorithms</subject><subject>multidisciplinary</subject><subject>Neural networks</subject><subject>Science</subject><subject>Science (multidisciplinary)</subject><subject>Social discrimination learning</subject><subject>Social networks</subject><subject>Social organization</subject><subject>Toxicity</subject><issn>2045-2322</issn><issn>2045-2322</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>COVID</sourceid><sourceid>PIMPY</sourceid><sourceid>DOA</sourceid><recordid>eNp9kU-P0zAQxSMEEqtlvwAnS5wD9vhPkr1V3QKVKiFV5WxNnEmbksbFdne13x5vg4ATc5mR9d5vNH5F8V7wj4LL-lNUQjd1yUGVlTESSnhV3ABXugQJ8Pqf-W1xF-OR59LQKNHcFI_b1W67WO5WD2yx3a2Xm9U9e6BELg1-Yr5nB0x0z-KZyB1YeiJKkbUYqWPOT49-vLwIcWQTXcK1pScffjCcOnZCdxgmYiNhmIZpz3Dc-zCkwym-K970OEa6-91vi--fV7vl13Lz7ct6udiUTtQCStMpVKA11pgvM4Ya1MZUgFDLjhqoDAgOUvSKC5JYKUJsRQ8KG-oqreVtsZ65ncejPYfhhOHZehzs9cGHvcWQBjeSdRnVuFaCy_62g9Zx0fbUCaq7XgrMrA8z6xz8zwvFZI_-EvLt0UohRf75Spusglnlgo8xUP9nq-D2JS47x2VzXPYal4VskrMpZvG0p_AX_R_XLzPalwk</recordid><startdate>20241121</startdate><enddate>20241121</enddate><creator>Sennary, Hameda A.</creator><creator>Abozaid, Ghada</creator><creator>Hemeida, Ashraf</creator><creator>Mikhaylov, Alexey</creator><general>Nature Publishing Group UK</general><general>Nature Publishing Group</general><general>Nature Portfolio</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>COVID</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>DOA</scope></search><sort><creationdate>20241121</creationdate><title>RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms</title><author>Sennary, Hameda A. ; Abozaid, Ghada ; Hemeida, Ashraf ; Mikhaylov, Alexey</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>639/705/1041</topic><topic>639/705/1042</topic><topic>639/705/117</topic><topic>639/705/794</topic><topic>Cyberbullying</topic><topic>Deep learning</topic><topic>Hate speech</topic><topic>Humanities and Social Sciences</topic><topic>Learning algorithms</topic><topic>multidisciplinary</topic><topic>Neural networks</topic><topic>Science</topic><topic>Science (multidisciplinary)</topic><topic>Social discrimination learning</topic><topic>Social networks</topic><topic>Social organization</topic><topic>Toxicity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sennary, Hameda A.</creatorcontrib><creatorcontrib>Abozaid, Ghada</creatorcontrib><creatorcontrib>Hemeida, Ashraf</creatorcontrib><creatorcontrib>Mikhaylov, Alexey</creatorcontrib><collection>SpringerOpen</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>Coronavirus Research Database</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Biological Sciences</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Scientific reports</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sennary, Hameda A.</au><au>Abozaid, Ghada</au><au>Hemeida, Ashraf</au><au>Mikhaylov, Alexey</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms</atitle><jtitle>Scientific reports</jtitle><stitle>Sci Rep</stitle><date>2024-11-21</date><risdate>2024</risdate><volume>14</volume><issue>1</issue><spage>28870</spage><epage>15</epage><pages>28870-15</pages><artnum>28870</artnum><issn>2045-2322</issn><eissn>2045-2322</eissn><abstract>There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these advantages, there are disadvantages that we always strive to find a solution. One of these disadvantages is sharing hate speech. In our study, we’re discussing a way to solve this phenomenon by using Term Frequency-Inverse Document Frequency (TF-IDF) based approach to feature engineering on eleven classifiers for machine and deep learning that can automatically identify hate speech. Three different databases were used, the first of which “Hate speech offensive tweets by Davidson et al.”, the second called "Twitter hate speech" and finally we merged the second data with (Cyberbullying dataset (toxicity_parsed_dataset)". The classifiers involved are Logistic Regression (LR), Naive Bayes (NB), Multi-layer Perceptron (MLP), and Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), K-Means, Decision Tree (DT), Gradient Boosting classifier (GBC), and the Extra Trees (ET) in addition to the convolutional neural network (CNN). Maximum accuracy was attained, which exceeded 99%.</abstract><cop>London</cop><pub>Nature Publishing Group UK</pub><doi>10.1038/s41598-024-76632-2</doi><tpages>15</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2045-2322 |
ispartof | Scientific reports, 2024-11, Vol.14 (1), p.28870-15, Article 28870 |
issn | 2045-2322 2045-2322 |
language | eng |
recordid | cdi_doaj_primary_oai_doaj_org_article_c2109cb32cf24bd2bc01bfed1e8df31a |
source | Publicly Available Content Database; Full-Text Journals in Chemistry (Open access); PubMed Central; Coronavirus Research Database; Springer Nature - nature.com Journals - Fully Open Access |
subjects | 639/705/1041 639/705/1042 639/705/117 639/705/794 Cyberbullying Deep learning Hate speech Humanities and Social Sciences Learning algorithms multidisciplinary Neural networks Science Science (multidisciplinary) Social discrimination learning Social networks Social organization Toxicity |
title | RETRACTED ARTICLE: Detection of hate: speech tweets based convolutional neural network and machine learning algorithms |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T20%3A20%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RETRACTED%20ARTICLE:%20Detection%20of%20hate:%20speech%20tweets%20based%20convolutional%20neural%20network%20and%20machine%20learning%20algorithms&rft.jtitle=Scientific%20reports&rft.au=Sennary,%20Hameda%20A.&rft.date=2024-11-21&rft.volume=14&rft.issue=1&rft.spage=28870&rft.epage=15&rft.pages=28870-15&rft.artnum=28870&rft.issn=2045-2322&rft.eissn=2045-2322&rft_id=info:doi/10.1038/s41598-024-76632-2&rft_dat=%3Cproquest_doaj_%3E3131663756%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c1812-6d4a4255a8a76666e9a56672a283de9276210231f401e3a74eaab1f24a9ed7553%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=3131663756&rft_id=info:pmid/&rfr_iscdi=true |