Loading…

Heterogeneous relation graphs as a formalism for representing linguistic information

Heterogeneous relation graphs (HRGs) can be used to represent arbitrary linguistic information. Originally designed for use in speech synthesis, HRGs can be used for speech annotation purposes also. In the HRG formalism, atomic linguistic entities such as words, syllables and phones are represented...

Full description

Saved in:
Bibliographic Details
Published in:Speech communication 2001, Vol.33 (1), p.153-174
Main Authors: Taylor, Paul, Black, Alan W, Caley, Richard
Format: Article
Language:English
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Heterogeneous relation graphs (HRGs) can be used to represent arbitrary linguistic information. Originally designed for use in speech synthesis, HRGs can be used for speech annotation purposes also. In the HRG formalism, atomic linguistic entities such as words, syllables and phones are represented by attribute value matrices known as linguistic items. Using attribute value matrices for items allows them to contain any type or amount of linguistic information. Items are organised into linguistic relations, which take the form of lists, trees or other structures. Items can belong to more than one relation, which allows words to appear in the word relation and, say, the syntax relation. The ability to have items in multiple relations, along with a function ability which can calculate certain values on the fly, eliminates much of the redundancy present in simpler systems. The HRG formalism is not tied to any particular linguistic theory, nor does it impose any preset ideas about what sort of format syntax, prosody or phonology information should have. This paper explains the HRG formalism in detail, and shows why we think this is superior to the types of “multi-level” formats normally used in speech synthesis and database annotation.
ISSN:0167-6393
1872-7182
DOI:10.1016/S0167-6393(00)00074-1