Loading…

Television Discourse Decoded: Comprehensive Multimodal Analytics at Scale

In this paper, we tackle the complex task of analyzing televised debates, with a focus on a prime time news debate show from India. Previous methods, which often relied solely on text, fall short in capturing the multimodal essence of these debates. To address this gap, we introduce a comprehensive...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2024-08
Main Authors: Agarwal, Anmol, Priyadarshi, Pratyush, Sinha, Shiven, Gupta, Shrey, Jangra, Hitkul, Kumaraguru, Ponnurangam, Garimella, Kiran
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Agarwal, Anmol
Priyadarshi, Pratyush
Sinha, Shiven
Gupta, Shrey
Jangra, Hitkul
Kumaraguru, Ponnurangam
Garimella, Kiran
description In this paper, we tackle the complex task of analyzing televised debates, with a focus on a prime time news debate show from India. Previous methods, which often relied solely on text, fall short in capturing the multimodal essence of these debates. To address this gap, we introduce a comprehensive automated toolkit that employs advanced computer vision and speech-to-text techniques for large-scale multimedia analysis. Utilizing state-of-the-art computer vision algorithms and speech-to-text methods, we transcribe, diarize, and analyze thousands of YouTube videos of a prime-time television debate show in India. These debates are a central part of Indian media but have been criticized for compromised journalistic integrity and excessive dramatization. Our toolkit provides concrete metrics to assess bias and incivility, capturing a comprehensive multimedia perspective that includes text, audio utterances, and video frames. Our findings reveal significant biases in topic selection and panelist representation, along with alarming levels of incivility. This work offers a scalable, automated approach for future research in multimedia analysis, with profound implications for the quality of public discourse and democratic debate. To catalyze further research in this area, we also release the code, dataset collected and supplemental pdf.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2929294728</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2929294728</sourcerecordid><originalsourceid>FETCH-proquest_journals_29292947283</originalsourceid><addsrcrecordid>eNqNjM0KgkAURocgSMp3uNBasBn_ahda1KJV7mUYbzQyOuYdhd4-gx4gvsXZnPMtmMeF2AVZxPmK-URNGIY8SXkcC49dSzQ4adK2g0KTsuNACAUqW2N9gNy2_YBP7EhPCLfRON3aWho4dtK8nVYE0sFdSYMbtnxIQ-j_uGbb86nML0E_2NeI5KpmPp8zqvj-uyjlmfjP-gCm3Tyk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2929294728</pqid></control><display><type>article</type><title>Television Discourse Decoded: Comprehensive Multimodal Analytics at Scale</title><source>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</source><creator>Agarwal, Anmol ; Priyadarshi, Pratyush ; Sinha, Shiven ; Gupta, Shrey ; Jangra, Hitkul ; Kumaraguru, Ponnurangam ; Garimella, Kiran</creator><creatorcontrib>Agarwal, Anmol ; Priyadarshi, Pratyush ; Sinha, Shiven ; Gupta, Shrey ; Jangra, Hitkul ; Kumaraguru, Ponnurangam ; Garimella, Kiran</creatorcontrib><description>In this paper, we tackle the complex task of analyzing televised debates, with a focus on a prime time news debate show from India. Previous methods, which often relied solely on text, fall short in capturing the multimodal essence of these debates. To address this gap, we introduce a comprehensive automated toolkit that employs advanced computer vision and speech-to-text techniques for large-scale multimedia analysis. Utilizing state-of-the-art computer vision algorithms and speech-to-text methods, we transcribe, diarize, and analyze thousands of YouTube videos of a prime-time television debate show in India. These debates are a central part of Indian media but have been criticized for compromised journalistic integrity and excessive dramatization. Our toolkit provides concrete metrics to assess bias and incivility, capturing a comprehensive multimedia perspective that includes text, audio utterances, and video frames. Our findings reveal significant biases in topic selection and panelist representation, along with alarming levels of incivility. This work offers a scalable, automated approach for future research in multimedia analysis, with profound implications for the quality of public discourse and democratic debate. To catalyze further research in this area, we also release the code, dataset collected and supplemental pdf.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Automation ; Bias ; Computer vision ; Data analysis ; Data collection ; Debates ; Multimedia ; Speech recognition ; Television ; Toolkits</subject><ispartof>arXiv.org, 2024-08</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2929294728?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>780,784,25753,37012,44590</link.rule.ids></links><search><creatorcontrib>Agarwal, Anmol</creatorcontrib><creatorcontrib>Priyadarshi, Pratyush</creatorcontrib><creatorcontrib>Sinha, Shiven</creatorcontrib><creatorcontrib>Gupta, Shrey</creatorcontrib><creatorcontrib>Jangra, Hitkul</creatorcontrib><creatorcontrib>Kumaraguru, Ponnurangam</creatorcontrib><creatorcontrib>Garimella, Kiran</creatorcontrib><title>Television Discourse Decoded: Comprehensive Multimodal Analytics at Scale</title><title>arXiv.org</title><description>In this paper, we tackle the complex task of analyzing televised debates, with a focus on a prime time news debate show from India. Previous methods, which often relied solely on text, fall short in capturing the multimodal essence of these debates. To address this gap, we introduce a comprehensive automated toolkit that employs advanced computer vision and speech-to-text techniques for large-scale multimedia analysis. Utilizing state-of-the-art computer vision algorithms and speech-to-text methods, we transcribe, diarize, and analyze thousands of YouTube videos of a prime-time television debate show in India. These debates are a central part of Indian media but have been criticized for compromised journalistic integrity and excessive dramatization. Our toolkit provides concrete metrics to assess bias and incivility, capturing a comprehensive multimedia perspective that includes text, audio utterances, and video frames. Our findings reveal significant biases in topic selection and panelist representation, along with alarming levels of incivility. This work offers a scalable, automated approach for future research in multimedia analysis, with profound implications for the quality of public discourse and democratic debate. To catalyze further research in this area, we also release the code, dataset collected and supplemental pdf.</description><subject>Algorithms</subject><subject>Automation</subject><subject>Bias</subject><subject>Computer vision</subject><subject>Data analysis</subject><subject>Data collection</subject><subject>Debates</subject><subject>Multimedia</subject><subject>Speech recognition</subject><subject>Television</subject><subject>Toolkits</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNjM0KgkAURocgSMp3uNBasBn_ahda1KJV7mUYbzQyOuYdhd4-gx4gvsXZnPMtmMeF2AVZxPmK-URNGIY8SXkcC49dSzQ4adK2g0KTsuNACAUqW2N9gNy2_YBP7EhPCLfRON3aWho4dtK8nVYE0sFdSYMbtnxIQ-j_uGbb86nML0E_2NeI5KpmPp8zqvj-uyjlmfjP-gCm3Tyk</recordid><startdate>20240806</startdate><enddate>20240806</enddate><creator>Agarwal, Anmol</creator><creator>Priyadarshi, Pratyush</creator><creator>Sinha, Shiven</creator><creator>Gupta, Shrey</creator><creator>Jangra, Hitkul</creator><creator>Kumaraguru, Ponnurangam</creator><creator>Garimella, Kiran</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240806</creationdate><title>Television Discourse Decoded: Comprehensive Multimodal Analytics at Scale</title><author>Agarwal, Anmol ; Priyadarshi, Pratyush ; Sinha, Shiven ; Gupta, Shrey ; Jangra, Hitkul ; Kumaraguru, Ponnurangam ; Garimella, Kiran</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_29292947283</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Automation</topic><topic>Bias</topic><topic>Computer vision</topic><topic>Data analysis</topic><topic>Data collection</topic><topic>Debates</topic><topic>Multimedia</topic><topic>Speech recognition</topic><topic>Television</topic><topic>Toolkits</topic><toplevel>online_resources</toplevel><creatorcontrib>Agarwal, Anmol</creatorcontrib><creatorcontrib>Priyadarshi, Pratyush</creatorcontrib><creatorcontrib>Sinha, Shiven</creatorcontrib><creatorcontrib>Gupta, Shrey</creatorcontrib><creatorcontrib>Jangra, Hitkul</creatorcontrib><creatorcontrib>Kumaraguru, Ponnurangam</creatorcontrib><creatorcontrib>Garimella, Kiran</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Agarwal, Anmol</au><au>Priyadarshi, Pratyush</au><au>Sinha, Shiven</au><au>Gupta, Shrey</au><au>Jangra, Hitkul</au><au>Kumaraguru, Ponnurangam</au><au>Garimella, Kiran</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Television Discourse Decoded: Comprehensive Multimodal Analytics at Scale</atitle><jtitle>arXiv.org</jtitle><date>2024-08-06</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>In this paper, we tackle the complex task of analyzing televised debates, with a focus on a prime time news debate show from India. Previous methods, which often relied solely on text, fall short in capturing the multimodal essence of these debates. To address this gap, we introduce a comprehensive automated toolkit that employs advanced computer vision and speech-to-text techniques for large-scale multimedia analysis. Utilizing state-of-the-art computer vision algorithms and speech-to-text methods, we transcribe, diarize, and analyze thousands of YouTube videos of a prime-time television debate show in India. These debates are a central part of Indian media but have been criticized for compromised journalistic integrity and excessive dramatization. Our toolkit provides concrete metrics to assess bias and incivility, capturing a comprehensive multimedia perspective that includes text, audio utterances, and video frames. Our findings reveal significant biases in topic selection and panelist representation, along with alarming levels of incivility. This work offers a scalable, automated approach for future research in multimedia analysis, with profound implications for the quality of public discourse and democratic debate. To catalyze further research in this area, we also release the code, dataset collected and supplemental pdf.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2024-08
issn 2331-8422
language eng
recordid cdi_proquest_journals_2929294728
source Publicly Available Content Database (Proquest) (PQ_SDU_P3)
subjects Algorithms
Automation
Bias
Computer vision
Data analysis
Data collection
Debates
Multimedia
Speech recognition
Television
Toolkits
title Television Discourse Decoded: Comprehensive Multimodal Analytics at Scale
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T16%3A42%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Television%20Discourse%20Decoded:%20Comprehensive%20Multimodal%20Analytics%20at%20Scale&rft.jtitle=arXiv.org&rft.au=Agarwal,%20Anmol&rft.date=2024-08-06&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2929294728%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_29292947283%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2929294728&rft_id=info:pmid/&rfr_iscdi=true