Research On Multi-Modal Cyberbullying Detection Based On Deep Learning

Posted on:2024-06-24

Degree:Master

Type:Thesis

Country:China

Candidate:Z Y Feng

Full Text:PDF

GTID:2568307109955249

Subject:Cyberspace security

Abstract/Summary:

PDF Full Text Request

Cyberbullying refers to malicious,offensive,or insulting behavior against individuals or groups on the Internet.This behavior includes,but is not limited to,the use of words or images to harass,threaten,insult,and spread rumors,causing mental harm to the victim,and even causing property damage and personal image damage.With the widespread use of social media on the Internet,cyberbullying has become a global social problem.In recent years,there has been a gradual increase in the use of voice for cyberbullying,which causes greater harm to the bullied than using text or images.Therefore,developing a multimodal fusion network model that integrates text and speech features to detect cyberbullying content has extremely important application value.This model can effectively analyze and identify the characteristics of voice and text in cyberbullying,and better protect the rights and interests of victims.Considering that cyberbullying security comes from multimodal forms of information sources,this article aims to conduct research from two aspects: natural language processing algorithms and multimodal fusion networks,in order to explore effective methods for cyberbullying detection.Specifically,the research work of this article is as follows:(1)A text detection model based on hierarchical attention network(BHF)is proposed.Using the pre training model BERT to output the basic features of the text,key information is extracted from both word level and sentence level dimensions through a hierarchical attention mechanism,further improving the feature extraction ability of the model and obtaining deeper semantic features;At the same time,a fusion mechanism is introduced to adjust the feature distribution of the model output,fuse basic features and deep features,and obtain a semantic representation that integrates three dimensions of words,sentences,and full text,enhancing the learning ability of the model.(2)A multimodal fusion cyberbullying detection model based on spatial representation(MFNSF)is proposed.The model constructs shared space and specific space,respectively mining shared and specific features between different modes,and constrains the mapping direction of the space through shared loss and specific loss.In addition,the improved attention network is used to obtain modal fusion features and input them as final features into the cyberbullying model for detection and classification.Comparing the MFNSF model with mainstream multimodal fusion models,the experimental results show that the MFNSF model outperforms other methods in three different multimodal data sets,CMCAD,COLD,and CMU-MOSI.

Keywords/Search Tags:

Cyberbullying, Multimodal representation learning, Attention mechanism, Pre-trained language model, Text classification

PDF Full Text Request

Related items

1	Unified Vision-Language Representation Learning For Multimodal AI
2	Research On Chinese Short Text Classification Based On Pre-trained Language Model
3	Research On Multi-label Text Classification Based On BERT
4	Research On Deep Learning Text Classification Method Based On BERT Model
5	Research On Sentiment Analysis Of Self-attention Mechanism Based On Pre-trained Language Model
6	Research On Chinese Text Summary Generation Based On Pre-trained Language Model
7	Research On Chinese Text Classification Algorithm Based On Deep Learning
8	Study On Cyberbullying Detection Driven By Multimodal Data
9	Research On Multimodal Sentiment Analysis Based On Joint Learning Of Image-text Features
10	Research On Text Representation Optimization Method Based On Pre-Trained Models