Multi-modal Feature-based Multimedia Event Detection

Posted on:2019-01-01

Degree:Master

Type:Thesis

Country:China

Candidate:S Z He

Full Text:PDF

GTID:2348330542998663

Subject:Electronics and Communications Engineering

Abstract/Summary:

PDF Full Text Request

Video has been widely used in all aspects of our lives,Because of its richness,intuitiveness and vividness in content.However,with the rapid development of the Internet,the scale of video data has dramatically increased.It takes a lot of manpower to analyze and manage massive amounts of video data artificially.Therefore,The Multimedia Event Detection(MED)task has come into being in recent years,and become a hot research in the field of computer vision and video retrieval.In recent years,deep learning continues to make major breakthroughs in the field of image,which provides a very effective reference for other areas of deep learning.However,there is not a mature network structure for complex video tasks such as MED.In this paper,multimedia event detection based on multi-modal feature is explored in detail.According to the advantages and disadvantages of the existing frameworks,which is the semantic-based and average-frame-based methods,the main work of this paper is as follows:1.First of all,combined with the advantages of deep learning and traditional feature aggregation methods,CNN and VLAD are applied to video event detection and achieved good results.2.Secondly,according to the hierarchical,structural and complexity of video multimedia,this paper extracts the audio features from the task of multimedia event detection experimentally and combine it with the visual features as complement.In response to the lack of task samples for multimedia event detection,a set of effective feature extraction framework is built.3.Finally,a multimedia event detection system based on multi-modal features was built,tested on multiple data sets,participated in the TRECVID 2017 MED and won the second place.It verified the effectiveness of the multimedia event detection framework and the algorithm proposed in this paper.

Keywords/Search Tags:

event detection, feature coding, multimodal, feature fusion, deep learning

PDF Full Text Request

Related items

1	Research And Application Of Multimodal Learning For Heterogeneous Feature Fusion
2	Research And Application Of Sensitive Information Detection Based On Multimodal Feature Fusion
3	Adaptive Face Anti-Spoofing Algorithm Based On Multimodal Feature Fusion
4	Research On Feature Fusion Of Multimodal Data Based On Deep Learning
5	Research On Multimodal Feature Fusion For Human Fall Detection
6	Research On Multimodal Event Classification For Social Media
7	Multimodal Fake News Detection Technology Based On Deep Learning
8	Research On Multimodal Emotion Recognition Based On Deep Learning
9	Video Target Detection And Tracking Based On Multimodal Data
10	Research On Multimodal Data Processing Algorithm Based On Deep Learning