Loading…

Multilingual Power and Ideology Identification in the Parliament: a Reference Dataset and Simple Baselines

We introduce a dataset on political orientation and power position identification. The dataset is derived from ParlaMint, a set of comparable corpora of transcribed parliamentary speeches from 29 national and regional parliaments. We introduce the dataset, provide the reasoning behind some of the ch...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2024-05
Main Authors: Çöltekin, Çağrı, Kopp, Matyáš, Meden, Katja, Morkevicius, Vaidas, Ljubešić, Nikola, Erjavec, Tomaž
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We introduce a dataset on political orientation and power position identification. The dataset is derived from ParlaMint, a set of comparable corpora of transcribed parliamentary speeches from 29 national and regional parliaments. We introduce the dataset, provide the reasoning behind some of the choices during its creation, present statistics on the dataset, and, using a simple classifier, some baseline results on predicting political orientation on the left-to-right axis, and on power position identification, i.e., distinguishing between the speeches delivered by governing coalition party members from those of opposition party members.
ISSN:2331-8422