Research On Text Classification Model Based On Formal Concept Analysis

Posted on:2018-02-14

Degree:Master

Type:Thesis

Country:China

Candidate:L Q Yan

Full Text:PDF

GTID:2348330533963018

Subject:Electronic and communication engineering

Abstract/Summary:

PDF Full Text Request

With the rapid development of Internet,the network will produce massive daily text data,how to extract useful information from these massive text data become the hotspot of today’s research.Text classification is an important part of data mining technology,which facilitates the efficient storage and mining of massive text information.Therefore,research has important value and significance.Firstly,based on the study of the general text classification model,a text classification model based on formal concept analysis is proposed for the current text classifier in the case of less training text set.The model divides the attribute characteristics of the text,forms the background of the form,constructs the concept lattice,and classifies the classification rules extracted from each concept in the concept lattice as the rules of text classification.Secondly,for the algorithm of concept lattice classification rule extraction,this paper presents an improved algorithm for extracting classification rules.The algorithm calculates the weight of each attribute in each classification rule,and converts the extracted classification rules into the sum of the weights of the attributes.The algorithm can extract more classification rules,can be better to avoid the classification of the classification rules are too few and can’t determine the situation.In addition,in the prediction,the determination of the sum of the attribute weights is more convenient than the previous classification rule,and can effectively reduce the spatial and temporal complexity of the judgment.Finally,this paper uses the method of chi-square verification as feature selection method in text preprocessing,and combines the text categorization model given in this paper to develop text categorization software based on formal concept analysis.In the demonstration model construction process,as the experimental platform,the use of open data sets: the calculation of precision,recall and F value of the three indicators,conducted a number of experimental comparison.The experimental results show that,in the case of relatively small text training set,the proposed model can also get a better classification effect,compared with the traditional text classifier due to over-fitting caused by poor classification of the situation has improved significantly.

Keywords/Search Tags:

Text Classification, Formal Concept Analysis, Concept Lattice

PDF Full Text Request

Related items

1	Research On Text Classification Model Based On Formal Concept Analysis
2	Formal Study On Key Issues In Classification Rule Mining Based On Formal Concept Analysis
3	The Research Of Concept Lattice Pruning Method And Its Application In Web Mining
4	Concept Lattice Generation Based On Attributes Classification
5	On The Characterization And Generation Of Three-way Concept Lattices
6	Study On The Unlabeled Text Mining Methods Based On The Concept Lattice Extension Models
7	Complementary Concepts And Their Properties And Generation In Concept Lattice
8	Building,Merging And Presenting Of Ontology Based On Formal Concept Analysis
9	The Research Of Chinese Web Page Classification Based On Formal Concept Analysis
10	Research On Data Mining Based On Distributed Concept Lattice Model