Loading…

Konzistence morfologického slovníku MorfFlex

Language corpora usually contain, in addition to their own texts, various types of annotations. The most common one is a morphological annotation, which consists in assigning a lemma and a morphological tag to each wordform. For morphological tagging, morphological dictionaries are traditionally use...

Full description

Saved in:
Bibliographic Details
Published in:Jazykovedný časopis 2021-01, Vol.72 (4), p.855-861
Main Authors: Hlaváčová, Jaroslava, Mikulová, Marie, Štěpánková, Barbora
Format: Article
Language:cze
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Language corpora usually contain, in addition to their own texts, various types of annotations. The most common one is a morphological annotation, which consists in assigning a lemma and a morphological tag to each wordform. For morphological tagging, morphological dictionaries are traditionally used. Our paper presents a new version of the so-called "Prague" morphological dictionary MorfFlex used for tagging many Czech corpora (particularly Prague Dependency Treebanks, corpora published by the Institute of the Czech National Corpus in Prague or large Czech web corpora of the Aranea series). Three basic principles were used to update the dictionary: the Golden Rule of Morphology, the Principle of Paradigm Unity, and the Principle of Paradigm Uniqueness.
ISSN:0021-5597
1338-4287
DOI:10.2478/jazcas-2022-0010