Loading…

Functions and translations of discourse markers in TED Talks: A parallel corpus study of underspecification in five languages

Discourse markers are highly polyfunctional, particularly in spoken settings. Because of their syntactic optionality, they are often omitted in translations, especially in the restricted space of subtitles such as the parallel transcripts of TED Talks. In this study, we combine discourse annotation...

Full description

Saved in:
Bibliographic Details
Published in:Journal of pragmatics 2019-03, Vol.142, p.139-155
Main Authors: Crible, Ludivine, Abuczki, Ágnes, Burkšaitienė, Nijolė, Furkó, Péter, Nedoluzhko, Anna, Rackevičienė, Sigita, Oleškevičienė, Giedrė Valūnaitė, Zikánová, Šárka
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Discourse markers are highly polyfunctional, particularly in spoken settings. Because of their syntactic optionality, they are often omitted in translations, especially in the restricted space of subtitles such as the parallel transcripts of TED Talks. In this study, we combine discourse annotation and translation spotting to investigate English discourse markers, focusing on their functions, omission and translation equivalents in Czech, French, Hungarian and Lithuanian. In particular, we study them through the lens of underspecification, of which we distinguish one monolingual and two multilingual types. After making an inventory of all discourse markers in the dataset, we zoom in on the three most frequent and, but and so. Our small-scale yet fine-grained corpus study suggests that the processes of underspecification are based on the semantics of discourse markers and are therefore shared cross-linguistically. However, not all discourse marker types nor their functions are equally affected by underspecification. Moreover, monolingual and multilingual underspecification do not always map for a particular marker. Beyond the empirical analysis of three highly frequent discourse markers in a sample of TED Talks, this study illustrates how translation and annotation can be combined to explore the multiple facets of underspecification in a monolingual and multilingual perspective. •Annotation of translation corpora reveals three types of underspecification that affect discourse markers.•Omissions and the variety of translation equivalents vary with the function of the original marker across languages.•And, but, so and their translations express a limited number of functions and domains, comparable across languages.
ISSN:0378-2166
1879-1387
DOI:10.1016/j.pragma.2019.01.012