Research On News Text Classification Based On Deep Learning

Posted on:2024-08-29

Degree:Master

Type:Thesis

Country:China

Candidate:D P Lin

Full Text:PDF

GTID:2568306920986149

Subject:Electronic information

Abstract/Summary:

PDF Full Text Request

With the rapid development of deep learning,this technology has brought new breakthroughs for text classification,among which the BERT model-based text data processing method has become the mainstream.This method first carries out pre-training in large-scale corpus and then can be fine-adjusted according to different downstream tasks.Compared with traditional deep learning methods,this kind of method has better performance and portability.Considering that Chinese mainly expresses semantics through words,the word mask task based on word granularity in the original BERT model can not relate the context well.Through experiments,this thesis compares the text classification effect of BERT model based on whole word MASK,original BERT model and other common deep learning models.The experimental results show that BERT model based on whole word MASK performs better than other models.Therefore,this thesis focuses on the optimization of BERT model based on the whole word MASK.Most of the text classification methods only focus on the in-depth study of a single model,and each single model has advantages and disadvantages,unable to capture the global semantic features and local semantic features at the same time,and the deepening of the depth of the network,easy to cause semantic loss.Therefore,a fusion model is proposed in this thesis.Based on BERT model which adopts full-word MASK sample generation strategy,it integrates the advantages of CNN and Bi GRU in text modeling to obtain more comprehensive semantic features for text classification.Firstly,the original6 Transformer layers of BERT model are removed,and the feature representation of the text is obtained through the BERT model.Then,the local semantic features are extracted by CNN and the global semantic features are extracted by Bi GRU.Finally,the model uses the feature fusion vectors of the two channels for text classification.In order to improve the quality of news text corpus and prepare for improved model comparison experiments,this thesis also builds a small Net Ease news data set through web crawler technology.The final experimental results show that the improved fusion model achieves higher accuracy without increasing the number of parameters,which proves the effectiveness of the fusion model in the task of news text classification.

Keywords/Search Tags:

News text classification, Deep learning, BERT, Whole Word Masking, Feature fusion

PDF Full Text Request

Related items

1	Research On News Text Classification Based On Deep Learning
2	Research On News Text Classification Method Based On Deep Learning And Multi-feature Fusion
3	Research And Implementation Of Multi Model Fake News Classification System Based On Bert
4	Research And Application Of News Text Classification Based On Deep Learning
5	Research On Bad Microblog Text Classification Based On Deep Learning
6	Classification Of News Short Text Based On Deep Learning
7	A Subject Classification To News Text Data Based On BERT Pre-training Model And VAE Feature Reconstruction
8	Research On Text Multi-Feature Classification Algorithm Based On BERT-LSTM
9	Research On Long Text Classification Method Of News Based On BERT And CNN
10	A Research Of Text Sentiment Classification Based On Deep Learning