Research On Deep Multi-Task Learning Method Based On Attention Mechanism

Posted on:2023-05-02

Degree:Master

Type:Thesis

Country:China

Candidate:Z H Chen

Full Text:PDF

GTID:2568306914972849

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

The essence of multi task learning is to obtain a reasonable knowledge sharing structure through the joint training between related tasks in the context of deep learning,so as to express the features effectively.The core of multi task learning is to optimize multiple objective functions at the same time and summarize the feature information extracted from different tasks.In the research process of multi task learning methods,this paper notes the following two problems in this field:first,most of the existing multi task learning methods are predefined network training structures,and this kind of solidified prior knowledge sharing structure between training tasks is only suitable for some specific application scenarios,which has strong limitations;Secondly,in the simple deep learning task scenario,the introduction of too many fusion layers will lead to the disappearance of the deep network gradient.However,there are few methods to solve such problems in this field.Secondly,in the simple deep learning task scenario,the gradient disappearance problem of deep network caused by too many fusion layers is introduced.However,there are few methods to solve this problem in this field.This paper introduces one-way connection on the proposed deep multi task learning method to transfer the shallow features to the deep network,letting the key features of the task express normally.To solve the above problems,this paper first proposes a multi task learning method based on attention mechanism.The innovation of this method is to combine spatial attention and channel attention,so that the attention mechanism no longer focuses on single task,but more on the joint feature expression of multi task.Through the attention mechanism training process based on joint features,knowledge sharing information can be extracted well,Avoid the problem of negative knowledge transfer to the greatest extent and improve the generalization performance of the model.Secondly,based on the proposed deep multi task learning method,this paper introduces one-way connection to transfer the shallow features to the deep network,so that the key features of the task can be expressed normally.Finally,the effectiveness of the proposed method is verified by comparative experiments on three public data sets.

Keywords/Search Tags:

deep multi-task learning, compound convolution, knowledge sharing, attention mechanism, short circuit connection

PDF Full Text Request

Related items

1	Research On Communication Signal Modulation Recognition Based On Deep Learning
2	Research On Gaussian Noise Image Denoising Algorithm Based On Dual Channel Extended Convolution Attention And Residual-dense Block
3	String Recognition Research Based On Deep Learning
4	Short Text Classification Algorithm Based On Temporal Convolution And Attention Mechanism
5	Research On Deep Image Denoising Network With Attention Mechanism
6	Research On Recommendation Algorithm Based On Multi-task Learning And Knowledge Graph
7	Research On Forecasting Loads Of A Regional Integrated Energy System Based On Improved Multi-task Learning
8	Research On Multi-Task Sentiment Classification Algorithm Based On Hard Sharing Mechanism
9	Study On Speaker Recognition Based On Deep Learning
10	Sentiment Analysis Of Social Networks Based On Deep Learning