Deep Generative Models for Vision and Language Intelligenc

Posted on:2019-09-17

Degree:Ph.D

Type:Thesis

University:Duke University

Candidate:Gan, Zhe

Full Text:PDF

GTID:2477390017984659

Subject:Artificial Intelligence

Abstract/Summary:

Deep generative models have achieved tremendous success in recent years, with applications in various tasks involving vision and language intelligence. In this dissertation, I will mainly discuss the contributions that I have made in this field during my Ph.D. study. Specifically, the dissertation is divided into two parts.;In the first part, I will mainly focus on one specific kind of deep directed generative model, called Sigmoid Belief Network (SBN). First, I will present a fully Bayesian algorithm for efficient learning and inference of SBN. Second, since the original SBN can be only used for binary image modeling, I will also discuss the generalization of it to model spare count-valued data for topic modeling, and sequential data for motion capture synthesis, music generation and dynamic topic modeling.;In the second part, I will mainly focus on visual captioning ( i.e., image-to-text generation), and conditional image synthesis. Specifically, I will first present Semantic Compositional Network for visual captioning, and emphasize interpretability and controllability revealed in the learning algorithm, via a mixture-of-experts design, and the usage of detected semantic concepts. I will then present Triangle Generative Adversarial Network, which is a general framework that can be used for joint distribution matching and learning the bidirectional mappings between two different domains. We consider the joint modeling of image-label, image-image and image-attribute pairs, with applications in semi-supervised image classification, image-to-image translation and attribute-based image editing.

Keywords/Search Tags:

Generative, Image, Modeling

Related items

1	Research On Unsupervised Image Generation Based On Generative Adversarial Networks
2	On Aesthetic Education And Human Beingâ€™s Image Creation
3	Generative Adversarial Network-based Text Generating Image Research
4	Virtual Image Generation Based On Generative Adversarial Networks
5	Non-convex Sparse Deviation Modeling Via Generative Models
6	Criticism And Reconstruction
7	Study On Cultivation Of Art Creativity Of Primary School Students In Urumqi City By Image Reading Middle Line Modeling Teaching
8	Development And Utilization Of High School Biology Generative Resources Under The New Curriculum
9	Action Research About Generative Teaching In Geography
10	The Practical Study On Generative Teaching Of Chemistry Curriculum In High School