Loading…

A longitudinal study of the top 1% toxic Twitter profiles

Toxicity is endemic to online social networks including Twitter. It follows a Pareto like distribution where most of the toxicity is generated by a very small number of profiles and as such, analyzing and characterizing these toxic profiles is critical. Prior research has largely focused on sporadic...

Full description

Saved in:

Bibliographic Details
Published in:	arXiv.org 2023-03
Main Authors:	Qayyum, Hina, Benjamin Zi Hao Zhao, Wood, Ian D, Ikram, Muhammad, Mohamed Ali Kaafar, Kourtellis, Nicolas
Format:	Article
Language:	English
Subjects:	Longitudinal studies Social networks Tags Toxicity
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

cited_by
cites
container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Qayyum, Hina Benjamin Zi Hao Zhao Wood, Ian D Ikram, Muhammad Mohamed Ali Kaafar Kourtellis, Nicolas
description	Toxicity is endemic to online social networks including Twitter. It follows a Pareto like distribution where most of the toxicity is generated by a very small number of profiles and as such, analyzing and characterizing these toxic profiles is critical. Prior research has largely focused on sporadic, event centric toxic content to characterize toxicity on the platform. Instead, we approach the problem of characterizing toxic content from a profile centric point of view. We study 143K Twitter profiles and focus on the behavior of the top 1 percent producers of toxic content on Twitter, based on toxicity scores of their tweets availed by Perspective API. With a total of 293M tweets, spanning 16 years of activity, the longitudinal data allow us to reconstruct the timelines of all profiles involved. We use these timelines to gauge the behavior of the most toxic Twitter profiles compared to the rest of the Twitter population. We study the pattern of tweet posting from highly toxic accounts, based on the frequency and how prolific they are, the nature of hashtags and URLs, profile metadata, and Botometer scores. We find that the highly toxic profiles post coherent and well articulated content, their tweets keep to a narrow theme with lower diversity in hashtags, URLs, and domains, they are thematically similar to each other, and have a high likelihood of bot like behavior, likely to have progenitors with intentions to influence, based on high fake followers score. Our work contributes insight into the top 1 percent of toxic profiles on Twitter and establishes the profile centric approach to investigate toxicity on Twitter to be beneficial.
doi_str_mv	10.48550/arxiv.2303.14603
format	article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2791774611</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2791774611</sourcerecordid><originalsourceid>FETCH-LOGICAL-a521-2d44ab572ad3ddc7a5796df3a1314f599111ae4d49b0e71ca470c269124d844e3</originalsourceid><addsrcrecordid>eNotjk1LwzAch4MgOOY-gLeAeGzN_yVNcxzDl8HAS-8ja1LNKOtsUp3f3oKent_p-T1C3IEqudZaPbrxEr9KJEUlcKXoSiyQCIqaEW_EKqWjUgorg1rTQti17IfTe8yTjyfXyzSPHzl0Mn8EmYezhIcZl9jK5jvmHEZ5Hocu9iHdiuvO9Sms_rkUzfNTs3ktdm8v2816VziNUKBndgdt0HnyvjVOG1v5jhwQcKetBQAX2LM9qGCgdWxUi5UFZF8zB1qK-z_t_Ps5hZT3x2Ea59S0R2PBGK4A6BeRxUcu</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2791774611</pqid></control><display><type>article</type><title>A longitudinal study of the top 1% toxic Twitter profiles</title><source>Publicly Available Content Database</source><creator>Qayyum, Hina ; Benjamin Zi Hao Zhao ; Wood, Ian D ; Ikram, Muhammad ; Mohamed Ali Kaafar ; Kourtellis, Nicolas</creator><creatorcontrib>Qayyum, Hina ; Benjamin Zi Hao Zhao ; Wood, Ian D ; Ikram, Muhammad ; Mohamed Ali Kaafar ; Kourtellis, Nicolas</creatorcontrib><description>Toxicity is endemic to online social networks including Twitter. It follows a Pareto like distribution where most of the toxicity is generated by a very small number of profiles and as such, analyzing and characterizing these toxic profiles is critical. Prior research has largely focused on sporadic, event centric toxic content to characterize toxicity on the platform. Instead, we approach the problem of characterizing toxic content from a profile centric point of view. We study 143K Twitter profiles and focus on the behavior of the top 1 percent producers of toxic content on Twitter, based on toxicity scores of their tweets availed by Perspective API. With a total of 293M tweets, spanning 16 years of activity, the longitudinal data allow us to reconstruct the timelines of all profiles involved. We use these timelines to gauge the behavior of the most toxic Twitter profiles compared to the rest of the Twitter population. We study the pattern of tweet posting from highly toxic accounts, based on the frequency and how prolific they are, the nature of hashtags and URLs, profile metadata, and Botometer scores. We find that the highly toxic profiles post coherent and well articulated content, their tweets keep to a narrow theme with lower diversity in hashtags, URLs, and domains, they are thematically similar to each other, and have a high likelihood of bot like behavior, likely to have progenitors with intentions to influence, based on high fake followers score. Our work contributes insight into the top 1 percent of toxic profiles on Twitter and establishes the profile centric approach to investigate toxicity on Twitter to be beneficial.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2303.14603</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Longitudinal studies ; Social networks ; Tags ; Toxicity</subject><ispartof>arXiv.org, 2023-03</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2791774611?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,27925,37012,44590</link.rule.ids></links><search><creatorcontrib>Qayyum, Hina</creatorcontrib><creatorcontrib>Benjamin Zi Hao Zhao</creatorcontrib><creatorcontrib>Wood, Ian D</creatorcontrib><creatorcontrib>Ikram, Muhammad</creatorcontrib><creatorcontrib>Mohamed Ali Kaafar</creatorcontrib><creatorcontrib>Kourtellis, Nicolas</creatorcontrib><title>A longitudinal study of the top 1% toxic Twitter profiles</title><title>arXiv.org</title><description>Toxicity is endemic to online social networks including Twitter. It follows a Pareto like distribution where most of the toxicity is generated by a very small number of profiles and as such, analyzing and characterizing these toxic profiles is critical. Prior research has largely focused on sporadic, event centric toxic content to characterize toxicity on the platform. Instead, we approach the problem of characterizing toxic content from a profile centric point of view. We study 143K Twitter profiles and focus on the behavior of the top 1 percent producers of toxic content on Twitter, based on toxicity scores of their tweets availed by Perspective API. With a total of 293M tweets, spanning 16 years of activity, the longitudinal data allow us to reconstruct the timelines of all profiles involved. We use these timelines to gauge the behavior of the most toxic Twitter profiles compared to the rest of the Twitter population. We study the pattern of tweet posting from highly toxic accounts, based on the frequency and how prolific they are, the nature of hashtags and URLs, profile metadata, and Botometer scores. We find that the highly toxic profiles post coherent and well articulated content, their tweets keep to a narrow theme with lower diversity in hashtags, URLs, and domains, they are thematically similar to each other, and have a high likelihood of bot like behavior, likely to have progenitors with intentions to influence, based on high fake followers score. Our work contributes insight into the top 1 percent of toxic profiles on Twitter and establishes the profile centric approach to investigate toxicity on Twitter to be beneficial.</description><subject>Longitudinal studies</subject><subject>Social networks</subject><subject>Tags</subject><subject>Toxicity</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNotjk1LwzAch4MgOOY-gLeAeGzN_yVNcxzDl8HAS-8ja1LNKOtsUp3f3oKent_p-T1C3IEqudZaPbrxEr9KJEUlcKXoSiyQCIqaEW_EKqWjUgorg1rTQti17IfTe8yTjyfXyzSPHzl0Mn8EmYezhIcZl9jK5jvmHEZ5Hocu9iHdiuvO9Sms_rkUzfNTs3ktdm8v2816VziNUKBndgdt0HnyvjVOG1v5jhwQcKetBQAX2LM9qGCgdWxUi5UFZF8zB1qK-z_t_Ps5hZT3x2Ea59S0R2PBGK4A6BeRxUcu</recordid><startdate>20230326</startdate><enddate>20230326</enddate><creator>Qayyum, Hina</creator><creator>Benjamin Zi Hao Zhao</creator><creator>Wood, Ian D</creator><creator>Ikram, Muhammad</creator><creator>Mohamed Ali Kaafar</creator><creator>Kourtellis, Nicolas</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20230326</creationdate><title>A longitudinal study of the top 1% toxic Twitter profiles</title><author>Qayyum, Hina ; Benjamin Zi Hao Zhao ; Wood, Ian D ; Ikram, Muhammad ; Mohamed Ali Kaafar ; Kourtellis, Nicolas</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a521-2d44ab572ad3ddc7a5796df3a1314f599111ae4d49b0e71ca470c269124d844e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Longitudinal studies</topic><topic>Social networks</topic><topic>Tags</topic><topic>Toxicity</topic><toplevel>online_resources</toplevel><creatorcontrib>Qayyum, Hina</creatorcontrib><creatorcontrib>Benjamin Zi Hao Zhao</creatorcontrib><creatorcontrib>Wood, Ian D</creatorcontrib><creatorcontrib>Ikram, Muhammad</creatorcontrib><creatorcontrib>Mohamed Ali Kaafar</creatorcontrib><creatorcontrib>Kourtellis, Nicolas</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Qayyum, Hina</au><au>Benjamin Zi Hao Zhao</au><au>Wood, Ian D</au><au>Ikram, Muhammad</au><au>Mohamed Ali Kaafar</au><au>Kourtellis, Nicolas</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A longitudinal study of the top 1% toxic Twitter profiles</atitle><jtitle>arXiv.org</jtitle><date>2023-03-26</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Toxicity is endemic to online social networks including Twitter. It follows a Pareto like distribution where most of the toxicity is generated by a very small number of profiles and as such, analyzing and characterizing these toxic profiles is critical. Prior research has largely focused on sporadic, event centric toxic content to characterize toxicity on the platform. Instead, we approach the problem of characterizing toxic content from a profile centric point of view. We study 143K Twitter profiles and focus on the behavior of the top 1 percent producers of toxic content on Twitter, based on toxicity scores of their tweets availed by Perspective API. With a total of 293M tweets, spanning 16 years of activity, the longitudinal data allow us to reconstruct the timelines of all profiles involved. We use these timelines to gauge the behavior of the most toxic Twitter profiles compared to the rest of the Twitter population. We study the pattern of tweet posting from highly toxic accounts, based on the frequency and how prolific they are, the nature of hashtags and URLs, profile metadata, and Botometer scores. We find that the highly toxic profiles post coherent and well articulated content, their tweets keep to a narrow theme with lower diversity in hashtags, URLs, and domains, they are thematically similar to each other, and have a high likelihood of bot like behavior, likely to have progenitors with intentions to influence, based on high fake followers score. Our work contributes insight into the top 1 percent of toxic profiles on Twitter and establishes the profile centric approach to investigate toxicity on Twitter to be beneficial.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2303.14603</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2023-03
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2791774611
source	Publicly Available Content Database
subjects	Longitudinal studies Social networks Tags Toxicity
title	A longitudinal study of the top 1% toxic Twitter profiles
url	http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-24T17%3A35%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20longitudinal%20study%20of%20the%20top%201%25%20toxic%20Twitter%20profiles&rft.jtitle=arXiv.org&rft.au=Qayyum,%20Hina&rft.date=2023-03-26&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2303.14603&rft_dat=%3Cproquest%3E2791774611%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-a521-2d44ab572ad3ddc7a5796df3a1314f599111ae4d49b0e71ca470c269124d844e3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2791774611&rft_id=info:pmid/&rfr_iscdi=true