Generating Natural Language Descriptions For Video Content Using Statistical Machine Translation

Posted on:2016-02-29

Degree:Master

Type:Thesis

Country:China

Candidate:W Qiu

Full Text:PDF

GTID:2308330476452630

Subject:Computer Science and Engineering

Abstract/Summary:

PDF Full Text Request

Humans use rich natural language to describe and communicate the visual content such as videos and images. In this thesis we employ a two-step approach to generate natural language descriptions for videos automatically. In the first step, a rich semantic representation of the visual content including e.g. activities and objects are predicted. In the second step, we approach the generation from predicted semantic representations as a statistical machine translation problem. The semantic representation is considered as source language and the natural language description is treated as the target language.We learn the translation model from a parallel corpus namely TACo S [1] which consists of video snippets, low-level annotation and corresponding natural language descriptions.We also apply word lattice decoding to deal with the uncertainty in the predicted semantic representations. Both automatic evaluation, i.e. BLEU and human judgments show that our approach improves significantly over several baseline systems inspired from related work. Our translation approach also shows improvement over related work on an image description task.

Keywords/Search Tags:

Statistical Machine Translation, Natural Language Generation, Video Description Gerneration

PDF Full Text Request

Related items

1	Learning for semantic parsing and natural language generation using statistical machine translation techniques
2	Research On Non-Autoregressive Models In Natural Language Generation
3	Research On Natural Language Description Generation For Short Video In Self Media
4	Research On The Construction And Anal Sis Of Common Sense Corpora For Natural Language Generation
5	A Study On Reordering Issues Of Phrase-Based Statistical Machine Translation
6	Automatic Response Generation For Microblog Dialogue
7	On Machine Translation Approaches Based On Multi-Level Knowledge
8	On Key Technologies For Pivot-Based Statistical Machine Translation
9	Chinese-based Dependency Grammar - The Naxi Language Statistical Machine Translation Research
10	Chinese And Mongolian Lexical Analysis Research And Its Application In Statistical Machine Translation