Word sense disambiguation for statistical machine translation

Posted on:2009-03-05

Degree:Ph.D

Type:Thesis

University:Hong Kong University of Science and Technology (Hong Kong)

Candidate:Carpuat, Marine Jacinthe

Full Text:PDF

GTID:2448390002990566

Subject:Computer Science

Abstract/Summary:

PDF Full Text Request

In this thesis, we show for the first time that lexical semantics modelling is useful in Statistical Machine Translation (SMT).;Word Sense Disambiguation (WSD), the task of resolving sense ambiguity to identify the right translation of a word is one of the major challenges faced by language translation systems. If the English word "drug" translates into French as either "drogue" (used as a narcotic) or "medicament" (used as a medicine), then an English-French machine translation system needs to disambiguate every use of "drug" in order to make the correct translations.;Heavy effort has been put in designing and evaluating dedicated WSD models, in particular with the Senseval series of workshops. This is partly motivated by the often unstated assumption that any full translation system, to achieve full performance, will sooner or later have to incorporate individual WSD components. However, in most machine translation architectures, in particular SMT, the WSD problem is typically not explicitly addressed. This paradoxical situation encouraged speculation that recent progress in SMT shows that SMT models are already very good at WSD and that current WSD systems have nothing to offer to state-of-the-art SMT.;Going beyond these untested assumptions and speculative claims, we conduct the first direct extensive empirical study of the strengths and weaknesses of WSD and SMT. Using the state-of-the-art HKUST WSD system, we surprisingly show that incorporating WSD predictions in SMT does not help translation quality. Puzzlingly, we also report results suggesting that typical SMT models cannot disambiguate word translations as well as dedicated WSD systems.;These seemingly contradictory results lead us to generalize conventional WSD models to incorporate assumptions at least as strong as in state-of-the-art SMT. Specifically, (1) WSD targets are generalized from words to phrases, (2) WSD sense inventories and annotation are learned automatically in the same way as conventional SMT translation lexicons, and (3) WSD models are fully integrated in SMT decoding.;Remarkably, the resulting generalized Phrase Sense Disambiguation (PSD) models improve translation quality across four different Chinese-to-English translation tasks, as measured by eight common automatic evaluation metrics. Further analysis reveals that generalization from conventional WSD to PSD is necessary in order to obtain consistent improvements in translation quality.

Keywords/Search Tags:

Translation, WSD, SMT, Word, Sense disambiguation

PDF Full Text Request

Related items

1	The Application Research Of Word Sense Disambiguation In The Statistical Machine Translation
2	Chinese Word Sense Disambiguation Based On Semantic
3	Word sense disambiguation for statistical machine translation
4	Research Of Text Processing Method And Application Based On Attention Mechanism And Word Sense Disambiguation
5	Research On Word Sense Disambiguation Based On GCN Model
6	Research On Word Sense Disambiguation Based On DBN
7	Research Of Word Sense Disambiguation Based On Word-sense Category Extending
8	Research On Chinese Word Sense Disambiguation Method Based On Deep Learning
9	Research Of Word Sense Disambiguation Based On Hybird Features And Rules
10	A Study Of Chinese Word Sense Disambiguation Based On Hownet