Loading…
A longitudinal study of the top 1% toxic Twitter profiles
Toxicity is endemic to online social networks including Twitter. It follows a Pareto like distribution where most of the toxicity is generated by a very small number of profiles and as such, analyzing and characterizing these toxic profiles is critical. Prior research has largely focused on sporadic...
Saved in:
Published in: | arXiv.org 2023-03 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | |
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Qayyum, Hina Benjamin Zi Hao Zhao Wood, Ian D Ikram, Muhammad Mohamed Ali Kaafar Kourtellis, Nicolas |
description | Toxicity is endemic to online social networks including Twitter. It follows a Pareto like distribution where most of the toxicity is generated by a very small number of profiles and as such, analyzing and characterizing these toxic profiles is critical. Prior research has largely focused on sporadic, event centric toxic content to characterize toxicity on the platform. Instead, we approach the problem of characterizing toxic content from a profile centric point of view. We study 143K Twitter profiles and focus on the behavior of the top 1 percent producers of toxic content on Twitter, based on toxicity scores of their tweets availed by Perspective API. With a total of 293M tweets, spanning 16 years of activity, the longitudinal data allow us to reconstruct the timelines of all profiles involved. We use these timelines to gauge the behavior of the most toxic Twitter profiles compared to the rest of the Twitter population. We study the pattern of tweet posting from highly toxic accounts, based on the frequency and how prolific they are, the nature of hashtags and URLs, profile metadata, and Botometer scores. We find that the highly toxic profiles post coherent and well articulated content, their tweets keep to a narrow theme with lower diversity in hashtags, URLs, and domains, they are thematically similar to each other, and have a high likelihood of bot like behavior, likely to have progenitors with intentions to influence, based on high fake followers score. Our work contributes insight into the top 1 percent of toxic profiles on Twitter and establishes the profile centric approach to investigate toxicity on Twitter to be beneficial. |
doi_str_mv | 10.48550/arxiv.2303.14603 |
format | article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2791774611</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2791774611</sourcerecordid><originalsourceid>FETCH-LOGICAL-a521-2d44ab572ad3ddc7a5796df3a1314f599111ae4d49b0e71ca470c269124d844e3</originalsourceid><addsrcrecordid>eNotjk1LwzAch4MgOOY-gLeAeGzN_yVNcxzDl8HAS-8ja1LNKOtsUp3f3oKent_p-T1C3IEqudZaPbrxEr9KJEUlcKXoSiyQCIqaEW_EKqWjUgorg1rTQti17IfTe8yTjyfXyzSPHzl0Mn8EmYezhIcZl9jK5jvmHEZ5Hocu9iHdiuvO9Sms_rkUzfNTs3ktdm8v2816VziNUKBndgdt0HnyvjVOG1v5jhwQcKetBQAX2LM9qGCgdWxUi5UFZF8zB1qK-z_t_Ps5hZT3x2Ea59S0R2PBGK4A6BeRxUcu</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2791774611</pqid></control><display><type>article</type><title>A longitudinal study of the top 1% toxic Twitter profiles</title><source>Publicly Available Content Database</source><creator>Qayyum, Hina ; Benjamin Zi Hao Zhao ; Wood, Ian D ; Ikram, Muhammad ; Mohamed Ali Kaafar ; Kourtellis, Nicolas</creator><creatorcontrib>Qayyum, Hina ; Benjamin Zi Hao Zhao ; Wood, Ian D ; Ikram, Muhammad ; Mohamed Ali Kaafar ; Kourtellis, Nicolas</creatorcontrib><description>Toxicity is endemic to online social networks including Twitter. It follows a Pareto like distribution where most of the toxicity is generated by a very small number of profiles and as such, analyzing and characterizing these toxic profiles is critical. Prior research has largely focused on sporadic, event centric toxic content to characterize toxicity on the platform. Instead, we approach the problem of characterizing toxic content from a profile centric point of view. We study 143K Twitter profiles and focus on the behavior of the top 1 percent producers of toxic content on Twitter, based on toxicity scores of their tweets availed by Perspective API. With a total of 293M tweets, spanning 16 years of activity, the longitudinal data allow us to reconstruct the timelines of all profiles involved. We use these timelines to gauge the behavior of the most toxic Twitter profiles compared to the rest of the Twitter population. We study the pattern of tweet posting from highly toxic accounts, based on the frequency and how prolific they are, the nature of hashtags and URLs, profile metadata, and Botometer scores. We find that the highly toxic profiles post coherent and well articulated content, their tweets keep to a narrow theme with lower diversity in hashtags, URLs, and domains, they are thematically similar to each other, and have a high likelihood of bot like behavior, likely to have progenitors with intentions to influence, based on high fake followers score. Our work contributes insight into the top 1 percent of toxic profiles on Twitter and establishes the profile centric approach to investigate toxicity on Twitter to be beneficial.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2303.14603</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Longitudinal studies ; Social networks ; Tags ; Toxicity</subject><ispartof>arXiv.org, 2023-03</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2791774611?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,27925,37012,44590</link.rule.ids></links><search><creatorcontrib>Qayyum, Hina</creatorcontrib><creatorcontrib>Benjamin Zi Hao Zhao</creatorcontrib><creatorcontrib>Wood, Ian D</creatorcontrib><creatorcontrib>Ikram, Muhammad</creatorcontrib><creatorcontrib>Mohamed Ali Kaafar</creatorcontrib><creatorcontrib>Kourtellis, Nicolas</creatorcontrib><title>A longitudinal study of the top 1% toxic Twitter profiles</title><title>arXiv.org</title><description>Toxicity is endemic to online social networks including Twitter. It follows a Pareto like distribution where most of the toxicity is generated by a very small number of profiles and as such, analyzing and characterizing these toxic profiles is critical. Prior research has largely focused on sporadic, event centric toxic content to characterize toxicity on the platform. Instead, we approach the problem of characterizing toxic content from a profile centric point of view. We study 143K Twitter profiles and focus on the behavior of the top 1 percent producers of toxic content on Twitter, based on toxicity scores of their tweets availed by Perspective API. With a total of 293M tweets, spanning 16 years of activity, the longitudinal data allow us to reconstruct the timelines of all profiles involved. We use these timelines to gauge the behavior of the most toxic Twitter profiles compared to the rest of the Twitter population. We study the pattern of tweet posting from highly toxic accounts, based on the frequency and how prolific they are, the nature of hashtags and URLs, profile metadata, and Botometer scores. We find that the highly toxic profiles post coherent and well articulated content, their tweets keep to a narrow theme with lower diversity in hashtags, URLs, and domains, they are thematically similar to each other, and have a high likelihood of bot like behavior, likely to have progenitors with intentions to influence, based on high fake followers score. Our work contributes insight into the top 1 percent of toxic profiles on Twitter and establishes the profile centric approach to investigate toxicity on Twitter to be beneficial.</description><subject>Longitudinal studies</subject><subject>Social networks</subject><subject>Tags</subject><subject>Toxicity</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNotjk1LwzAch4MgOOY-gLeAeGzN_yVNcxzDl8HAS-8ja1LNKOtsUp3f3oKent_p-T1C3IEqudZaPbrxEr9KJEUlcKXoSiyQCIqaEW_EKqWjUgorg1rTQti17IfTe8yTjyfXyzSPHzl0Mn8EmYezhIcZl9jK5jvmHEZ5Hocu9iHdiuvO9Sms_rkUzfNTs3ktdm8v2816VziNUKBndgdt0HnyvjVOG1v5jhwQcKetBQAX2LM9qGCgdWxUi5UFZF8zB1qK-z_t_Ps5hZT3x2Ea59S0R2PBGK4A6BeRxUcu</recordid><startdate>20230326</startdate><enddate>20230326</enddate><creator>Qayyum, Hina</creator><creator>Benjamin Zi Hao Zhao</creator><creator>Wood, Ian D</creator><creator>Ikram, Muhammad</creator><creator>Mohamed Ali Kaafar</creator><creator>Kourtellis, Nicolas</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20230326</creationdate><title>A longitudinal study of the top 1% toxic Twitter profiles</title><author>Qayyum, Hina ; Benjamin Zi Hao Zhao ; Wood, Ian D ; Ikram, Muhammad ; Mohamed Ali Kaafar ; Kourtellis, Nicolas</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a521-2d44ab572ad3ddc7a5796df3a1314f599111ae4d49b0e71ca470c269124d844e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Longitudinal studies</topic><topic>Social networks</topic><topic>Tags</topic><topic>Toxicity</topic><toplevel>online_resources</toplevel><creatorcontrib>Qayyum, Hina</creatorcontrib><creatorcontrib>Benjamin Zi Hao Zhao</creatorcontrib><creatorcontrib>Wood, Ian D</creatorcontrib><creatorcontrib>Ikram, Muhammad</creatorcontrib><creatorcontrib>Mohamed Ali Kaafar</creatorcontrib><creatorcontrib>Kourtellis, Nicolas</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Qayyum, Hina</au><au>Benjamin Zi Hao Zhao</au><au>Wood, Ian D</au><au>Ikram, Muhammad</au><au>Mohamed Ali Kaafar</au><au>Kourtellis, Nicolas</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A longitudinal study of the top 1% toxic Twitter profiles</atitle><jtitle>arXiv.org</jtitle><date>2023-03-26</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Toxicity is endemic to online social networks including Twitter. It follows a Pareto like distribution where most of the toxicity is generated by a very small number of profiles and as such, analyzing and characterizing these toxic profiles is critical. Prior research has largely focused on sporadic, event centric toxic content to characterize toxicity on the platform. Instead, we approach the problem of characterizing toxic content from a profile centric point of view. We study 143K Twitter profiles and focus on the behavior of the top 1 percent producers of toxic content on Twitter, based on toxicity scores of their tweets availed by Perspective API. With a total of 293M tweets, spanning 16 years of activity, the longitudinal data allow us to reconstruct the timelines of all profiles involved. We use these timelines to gauge the behavior of the most toxic Twitter profiles compared to the rest of the Twitter population. We study the pattern of tweet posting from highly toxic accounts, based on the frequency and how prolific they are, the nature of hashtags and URLs, profile metadata, and Botometer scores. We find that the highly toxic profiles post coherent and well articulated content, their tweets keep to a narrow theme with lower diversity in hashtags, URLs, and domains, they are thematically similar to each other, and have a high likelihood of bot like behavior, likely to have progenitors with intentions to influence, based on high fake followers score. Our work contributes insight into the top 1 percent of toxic profiles on Twitter and establishes the profile centric approach to investigate toxicity on Twitter to be beneficial.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2303.14603</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2023-03 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2791774611 |
source | Publicly Available Content Database |
subjects | Longitudinal studies Social networks Tags Toxicity |
title | A longitudinal study of the top 1% toxic Twitter profiles |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-24T17%3A35%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20longitudinal%20study%20of%20the%20top%201%25%20toxic%20Twitter%20profiles&rft.jtitle=arXiv.org&rft.au=Qayyum,%20Hina&rft.date=2023-03-26&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2303.14603&rft_dat=%3Cproquest%3E2791774611%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-a521-2d44ab572ad3ddc7a5796df3a1314f599111ae4d49b0e71ca470c269124d844e3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2791774611&rft_id=info:pmid/&rfr_iscdi=true |