Loading…

Computational Models of Learning the Raising-Control Distinction

We consider the task of learning three verb classes: raising (e.g., seem ), control (e.g., try ) and ambiguous verbs that can be used either way (e.g., begin ). These verbs occur in sentences with similar surface forms, but have distinct syntactic and semantic properties. They present a conundrum be...

Full description

Saved in:

Bibliographic Details
Published in:	Research on language and computation 2010-09, Vol.8 (2-3), p.169-207
Main Authors:	Mitchener, William Garrett, Becker, Misha
Format:	Article
Language:	English
Subjects:	Computational Linguistics Computer Science Linguistics Philosophy of Language Social Sciences Symbolic and Algebraic Manipulation
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	We consider the task of learning three verb classes: raising (e.g., seem ), control (e.g., try ) and ambiguous verbs that can be used either way (e.g., begin ). These verbs occur in sentences with similar surface forms, but have distinct syntactic and semantic properties. They present a conundrum because it would seem that their meaning must be known to infer their syntax, and that their syntax must be known to infer their meaning. Previous research with human speakers pointed to the usefulness of two cues found in sentences containing these verbs: animacy of the sentence subject and eventivity of the predicate embedded under the main verb. We apply a variety of algorithms to this classification problem to determine whether the primary linguistic data is sufficiently rich in this kind of information to enable children to resolve the conundrum, and whether this information can be extracted in a way that reflects distinctive features of child language acquisition. The input consists of counts of how often various verbs occur with animate subjects and eventive predicates in two corpora of naturalistic speech, one adult-directed and the other child-directed. Proportions of the semantic frames are insufficient. A Bayesian attachment model designed for a related language learning task does not work well at all. A hierarchical Bayesian model (HBM) gives significantly better results. We also develop and test a saturating accumulator that can successfully distinguish the three classes of verbs. Since the HBM and saturating accumulator are successful at the classification task using biologically realistic calculations, we conclude that there is sufficient information given subject animacy and predicate eventivity to bootstrap the process of learning the syntax and semantics of these verbs.
ISSN:	1570-7075 1572-8706
DOI:	10.1007/s11168-011-9073-6