Hot Topic Discovery And Polarity Analysis Of Microblog Based On Keywords

Posted on:2023-03-13

Degree:Master

Type:Thesis

Country:China

Candidate:J H Lin

Full Text:PDF

GTID:2557307103968909

Subject:Applied Statistics

Abstract/Summary:

PDF Full Text Request

Microblog has become the main community for Internet users to share short and real-time information on the Internet.The multi-source heterogeneous information representation and the extremely low threshold for entry and communication mean that it has become the core site for information dissemination and public sentiment fermentation.Therefore,the research on microblog information flow becomes very necessary.This study mainly proposes two tasks:(1)By introducing the concept of head word,this paper proposes an improved LDA topic model in order to obtain better topic distribution effect in microblog hot topic discovery.The research is based on microblog hot data,firstly,text representation learning was carried out.The models based on Bert and Word2 Vec were set as the experimental groups,and the models based on TF-IDF and BOW were set as the control groups.Finally,the experimental groups and the control groups generated improved LDA models and traditional LDA models respectively.By comparing the traditional LDA model with the improved LDA model,it is found that the LDA model generated by the improved method is better than generated by the traditional method in terms of the distribution concentration of high-frequency words,and is more suitable for the generation of hot topics in downstream task applications.(2)On the basis of hot topic generation,this paper proposes an improved sentiment analysis model based on ABSA,and obtains the sentiment polarity distribution of each type of topic.When giving different weights to local context features,the research uses the semantic distance-based weight decay method SCDW to replace the location based decay method CDW in the original methodology,in order to balance the high risk of CDM and the weak effect of CDW,and obtain an efficient and stable model.By setting LSA-S-ME,LSA-S-DE in the control group and SLCF in the experimental group,the experiment finally found that the AUC effect of SLCF was close to CDM at the peak and better than CDW on the whole.In addition,the research also proposes a probability calibration work under the polarity binary classification,which realizes the quantification of the polar probability,and calculates the polarity value of the text based on the keywords.Finally,for a large number of microblog data,after data cleaning,word segmentation and stop words,and text representation learning,under two consecutive jobs,this research finally refined and generated a corpus based hot topic classification and corresponding sentimental polarity distribution.

Keywords/Search Tags:

microblog hot topics, LDA, BERT, keywords, Sentiment polarity analysis, ABSA

PDF Full Text Request

Related items

1	Research On Emotional Analysis Of Hot Events Based On Microblog Text
2	Research On Text Sentiment Analysis Based On BERT-BIGRU Model
3	Research On The Influencing Factors Campus During The COVID-19
4	Public Opinion Analysis Algorithm Based On CNN-BILSTM Network And BERT
5	Learning Behavior And Sentiment Analysis In MOOCs
6	Research On Microblog Sentiment Analysis Based On Social Relations Among Users
7	Sentiment Analysis Of Product Reviews Based On Text Mining
8	Research And Application Of Sentiment Analysis For Course Reviews
9	Research On Video Barrage Text Based On Sentiment Analysis
10	Based On BERT-BiLSTM-CNN Multi-feature Fusion Research And Application In Public Opinion Analysis Of Three-child Policy