Resaerch And Implementation Of Image Captioning Algorithm With High-level Semantics Based On Deep Learning

Posted on:2022-02-18

Degree:Master

Type:Thesis

Country:China

Candidate:C C Lu

Full Text:PDF

GTID:2518306338468924

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

With the collaborative development of Internet technology,a great plenty of information stored on the social media and network.Among them,as an important information medium,image often presents the characteristics of large amount of data,more content and wide coverage.In the face of the large volume and continuous visual information in the network,manually annotation is a heavy workload,high cost and many differences,and it is not conducive to improving the efficiency of image classification and retrieval in the era of big data.Image description generated system allows users to input images and automatically return the natural language description matching the content.This mode helps to save time and labor costs,and realize the automatic understanding of image visual information.It can not only solve the efficiency and performance problems of image retrieval,but also maintain the security of network environment.As deep learning advances,image caption has made great progress in last few years.However,there are still a series of "semantic gap" problems in the existing methods,such as the absence of accuracy,detail and diversity,which means that there is still a huge space for improvement.Aiming at the existing problems in automatic description,the following research results are achieved:(1)For the semantic information deviation of generated description,this paper proposes a caption algorithm based on the fusion of residual attention mechanism and ordered memory module.It can extract visual features efficiently,and take advantage of ordered memory to learn the level information in sentences.Experiments show that,this fusion algorithm can enhance the semantic correctness of description details,and generate a more accurate and better structured description in actual.(2)For the lack of high-level semantic details in generated description,this paper proposes a visual relationship description algorithm based on knowledge map.It presents the graph of visual relationship in image vividly,explores the visual relationship between entities deeply to enrich the description details.Experiments show that,this algorithm can enhance the semantic integrity of description,and generate a more comprehensive and natural description.(3)Based on the above research,this paper proposes an image caption algorithm for high-level semantic based on deep learning.Combined with many advanced mechanisms in deep learning,the above two algorithms are integrated to optimize and improve the image description technology.The purpose is to bridge the "semantic gap" and realize automatic generation of high-quality image description.This paper evaluates the proposed algorithm on large datasets,confirms that it can generate a more accurate,comprehensive,hierarchical,fluent language description.

Keywords/Search Tags:

visual feature, semantic information, natural language description, attention mechanism

PDF Full Text Request

Related items

1	Research On Natural Language Description Generation For Short Video In Self Media
2	Context-Aware Natural Language Semantic Representation Research
3	Research On Obgect Detection Algorithm Based On Natural Language Description
4	Research On Image Description Generation Based On Visual Attention
5	Research On Person Re-identification Algorithm Based On Natural Language Description
6	Research And Implementation Of Visual Question Answering System Based On Collaborative Attention Mechanism
7	Research On The Generation Method Of Chinese Image Description Based On Dual Attention Mechanism
8	Design And Implementation Of Image Natural Language Description Based On Feature Fusion
9	Research On Natural Language Understanding Of Air Travel Based On Joint Modeling
10	Analysis Of Web Service Composition Based On Natural Language Demands Description