Research On Adversarial Examples Defense Algorithm For Sentiment Classification

Posted on:2024-01-23

Degree:Master

Type:Thesis

Country:China

Candidate:S H Dai

Full Text:PDF

GTID:2568307181954419

Subject:Electronic Information (in the field of computer technology) (professional degree)

Abstract/Summary:

PDF Full Text Request

Deep neural network has become the core technology in many fields of artificial intelligence.However,the deep neural network model has problems of poor security and robustness,and the attackers make mistakes in the prediction of its model by malicious adversarial samples,which leads to serious consequence.Sentiment classification is an emerging topic in the field of text data mining,but the current methods of deducing sentence polarity have two major shortcomings: on the one hand,there is currently a lack of a large and well-curated corpus;on the other hand,current solutions based on deep learning are particularly vulnerable to attacks from adversarial samples.Therefore,in view of the above problems,the main work of this thesis is as follows:① This thesis proposes a target-specific sentiment classification adversarial samples defense algorithm based on word-masking data enhancement and adversarial learning.Firstly,the method of masking target-specific entities is used to replace synonyms and insert words randomly.Secondly,this thesis uses the word-masking data enhanced target-specific sentiment classification dataset to train the corresponding sentiment classification model.Finally,this thesis combines data enhancement and adversarial learning to construct a target-specific sentiment classification model.The target-specific sentiment classification algorithm based on word-masking data enhancement and adversarial learning has stronger robustness and higher accuracy.② This thesis proposes a new sentiment classification adversarial training method,which is based on neural network and gradient reversal adversarial samples defense algorithm,the method combines hierarchical neural network and gradient reversal.Firstly,the baseline model is used to extract text feature and feature gradient information.Secondly,the original gradient information is calculated by gradient reversal to obtain the gradient information after inversion.Finally,the original gradient information and the inverted gradient information are fused to obtain a new gradient of the model for adversarial training.The hierarchical neural network and the gradient reversal adversarial training algorithm improve the robustness and accuracy of sentiment classification,and reduce the probability of the model being attacked by adversarial samples.Summary,this thesis proposes two defense algorithms of sentiment classification adversarial samples based on different text attack methods and different sentiment classification baseline models,combined with the principle of adversarial training.The experimental results show that compared with the six baselines(SC),WMDE-AL algorithm improves the Macro-F1 index by 0.90%～2.64%,1.59%～3.09% and0.18%～1.71%,respectively.Aiming at the sentence-level adduction sample defense model,six text adduction sample defense methods were used to compare,and the Boa and Succ indexes of the proposed HNN-GRAT method obtained the optimal values.(For the Deep Word Bug attack,AGNEWS,IMDB and SST-2 datasets.Boa was 41.50%,67.50%,28.15%,and Succ was 55.90%,27.45%,69.89%,respectively.Therefore,the proposed WMDE-AL algorithm and HNN-GRAT algorithm effectively improve the robustness and adversarial defense ability of target-specific sentiment classification and sentence-level sentiment classification models.

Keywords/Search Tags:

Sentiment classification, adversarial training, gradient reversal, adversarial samples, robustness

PDF Full Text Request

Related items

1	Research On Generating Transferable Adversarial Samples And Enhancing Adversarial Robustness Methods
2	Research On Adversarial Sample Generation And Defease Methods For Text Classification
3	Research On Text Sentiment Analysis Based On Adversarial Training
4	Research On The Robustness Of Deep Image Classification Models Based On Adversarial Examples
5	Research On Adversarial Sample Defense Method Based On Image Classification
6	Research On Attack And Defense Algorithms Of Adversarial Samples Based On GAN
7	Robustness Of Machine Learning Models Based On Adversarial Examples
8	Research On Adversarial Training Methods Of Deep Neural Networks
9	Research On Adversarial Defense Robustness Of Deep Model
10	Hypersphere Embedded Adversarial Training In Image Recognition