Research On Online Learing Based Microblog Filtering

Posted on:2014-07-22

Degree:Master

Type:Thesis

Country:China

Candidate:F H Ceng

Full Text:PDF

GTID:2268330425465999

Subject:Computer applications and technology

Abstract/Summary:

PDF Full Text Request

With the popularity of the Internet and the development of information technology, thenumber of microblog users grows rapidly, microblog data has an explosive increase. Whenlogging in, users usually face a lot of updates so that they have difficulty to get theirinteresting blog. Microblog filtering technology has already became an important part ofmicroblog service. The microblog filtering technology mainly solve two problems---providemicroblog and related information users have interest in and filter garbage microbloginformation, such as reactionary information pornography, violence, and advertising.Usersâ€™ interesting changes as time changes, the traditional batch learning cannot adaptto the update of user interest model. But machine learning based on online learning cansolve these questions. This paper mainly contents these parts as follows:Firstly, I research microblogâ€™s overall framework which includes microblog featureextraction, microblog feature selection, computation of microblog feature weight andfiltering based on machine learning. This paper descriped some machine learning in detail,like logistic regression algorithm, support vector machine algorithm, K-nearest neighboralgorithm and Naive Bayes algorithm, and also analyaed their advantages anddisadvantages.Secondly, I researched microblog filtering technology framework and microblogfiltering. And I focused on microblog filtering based on the online logistic regression modeland the online support vector machine and compared these two metodsâ€™ strengths andweaknesse through time complexity and performance of microblog filtering.Thirdly, I research microblog filtering method based on improved online supportvector machine model. Online support vector machine filter outperforms the logisticregression model, but there is a long time to run the shortcomings. the paper by reducing thesize of the training set, reducing the number of training and reducing the number ofiterations are three ways to enhance the online support vector machine filter the control ofthe time spent. Proved through experiments while filtering performance fluctuate slightly,but compared to the advantage of the efficiency can almost be ignored meter, and when the larger amount of data, and the efficiency of the more obvious advantages.Lastï¼Œmicroblog filtering has been researched based on feedback learning. Users willhave feedback information when they browse microblogs, like commenting, forwarding andcollecting. We can get the information about usersâ€™ interests and then we can classifymicroblogs. With the experimental results, we can know the feedback learning can improvethe performance of microblog filtering.

Keywords/Search Tags:

microblog filtering, feature extraction, online learning, feedback learning

PDF Full Text Request

Related items

1	Research On Online Learning Based Spam Filtering
2	Research On Spam Filtering Based On Social Network
3	Assessment Of Online Learning Effectiveness By Integrating Learner Implicit Feedback
4	Research Of Online Multiclass Learning With "Bandit" Feedback Under A Confidence-weighted Approach
5	Research On Rumor Feature Extraction And Recognition Algorithms On Microblog
6	Feedback Incremental Learning Algorithm And Its Application In Network Information Filtering Research
7	Research And Implementation Of Pedestrian Detection Algorithm Based On Feature Fusion And Online Learning
8	Research On Text Filtering System Based On Active Learning
9	Research On Time Based Microblog Search And Filtering
10	Research On The Feature Extraction Of Microblog Rumors And The Method Of Rumor Recognition Based On Multi-model Fusion Strategy