Loading…

Modifications of the Czech Morphological Dictionary for Consistent Corpus Annotation

We describe systematic changes that have been made to the Czech morphological dictionary related to annotating new data within the project of Prague Dependency Treebank (PDT). We bring new solutions to several complicated morphological features that occur in Czech texts. We introduced two new parts...

Full description

Saved in:
Bibliographic Details
Published in:Jazykovedný časopis 2019-12, Vol.70 (2), p.380-389
Main Authors: Hlaváčová, Jaroslava, Mikulová, Marie, Štěpánková, Barbora, Hajič, Jan
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We describe systematic changes that have been made to the Czech morphological dictionary related to annotating new data within the project of Prague Dependency Treebank (PDT). We bring new solutions to several complicated morphological features that occur in Czech texts. We introduced two new parts of speech, namely foreign word and segment. We adopted new principles for morphological analysis of global and inflectional variants, homonymous lemmas, abbreviations and aggregates. The changes were initiated by the need of consistency between the data and the dictionary and of the dictionary itself.
ISSN:0021-5597
1338-4287
DOI:10.2478/jazcas-2019-0067