Corpus-based learning for pronominal anaphora resolution

Posted on:2006-04-24

Degree:M.Sc

Type:Thesis

University:University of Alberta (Canada)

Candidate:Bergsma, Shane Anthony

Full Text:PDF

GTID:2455390008969274

Subject:Computer Science

Abstract/Summary:

PDF Full Text Request

Anaphora resolution is a challenging and important problem in Natural Language Processing. We use machine learning from both labelled and unlabelled corpora to gather probabilistic information for reference resolution. Our unsupervised, unlabelled textual extraction approaches are a form of "bootstrapping" for information extraction. By assuming coreference links in unlabelled text, we can infer statistically meaningful information to assist coreference determination. This includes information on a noun's gender and number, its frequency as an antecedent, and the likelihood of coreference occurring between entities along a given syntactic relationship. These new sources of information are combined with well known constraints and preferences by inducing classifiers using supervised learning.

Keywords/Search Tags:

Information

PDF Full Text Request

Related items

1	Information Classification And Information Value
2	Information Processing Strategies Of Political Interviews
3	Information Scavenging: A Creative Design Study On Network Information Garbage Disposa
4	On The Of Internal Contradictions Of Information Capitalism And Its Development Path
5	Information Processing In The Summing Up Of Issues In Chinese Civil Court Hearings: A Discourse Information Perspective
6	Research On The Information Integration Of Large Amounts Of Information Design
7	The Generationand Interpretation Of Information Chart Design
8	The Current Human Condition And Ethics In Information Technology
9	The Historical Evolution Research Of Information Interaction Design Mode
10	Case Study Of Translating CES Project