Linguistically Motivated Combinatory Categorial Grammar Induction

Posted on:2017-02-02

Degree:Ph.D

Type:Thesis

University:University of Washington

Candidate:Wang, Adrienne X

Full Text:PDF

GTID:2445390005471615

Subject:Computer Science

Abstract/Summary:

Combinatory Categorial Grammar (CCG) is a widely studied grammar formalism that has been used in a variety of NLP applications, e.g., semantic parsing, and machine translation. One key challenge in building effective CCG parsers is a lack of labeled training data, which is expensive to produce manually. Instead, researchers have developed automated approaches for inducing the grammars. These algorithms learn lexical entries that define the syntax and semantics of individual words, and probabilistic models that rank the set of possible parses for each sentence. Various types of universal or language specific prior knowledge and supervising signals can be exploited to prune the grammar search space and constrain parameter estimation.;In this thesis, we introduce new methods for inducing linguistically motivated grammars that generalize well from small amounts of labeled training data. We first present a CCG grammar induction scheme for semantic parsing, where the grammar is restricted by modeling a wide range of linguistic constructions, then introduce a new lexical generalization model that abstracts over systematic morphological, syntactic, and semantic variations in languages. Finally, we describe a weakly supervised approach for inducing broad scale CCG syntactic structures for multiple languages. Such approaches would have the greatest utility for low-resource languages, as well as domains where it is prohibitively expensive to gather sufficient amounts of training data.

Keywords/Search Tags:

Grammar, CCG, Training data

Related items

1	A CCG-Based Method for Training a Semantic Role Labeler in the Absence of Explicit Syntactic Training Data
2	The E-C Translation Of The Big Data Agenda: Data Ethics And Critical Data Studies (Chapter 1-2) And A Report On The Translation
3	The Effects Of Big Data On Movies And TV Drama
4	A Quasi-experimental Study Of The Impact Of Metacognitive Strategy Training For Grammar Learning
5	Application Of DDL To Dynamic Grammar Teaching
6	Research On Data Reduction Methods For Neural Machine Translation
7	A Report On The Translation Of Data Science For Business-What You Need To Know About Data Mining And Data-analytic Thinking(Chapter 3)
8	Data Collection And Analysis Based On Animation Works
9	A Study On The Application Of Data-driven Learning Approach In English Grammar Teaching At Higher Vocational College
10	An Empirical Study Of Data-Driven Learning In English Attributive Clause Instruction In Senior High Schools