Font Size: a A A

Research On Text Analysis And Co-word Occurrence Analysis Of Three Rural Issues Based On Text Mining

Posted on:2019-09-29Degree:MasterType:Thesis
Country:ChinaCandidate:B LiFull Text:PDF
GTID:2429330548982045Subject:Management Science and Engineering
Abstract/Summary:PDF Full Text Request
In China,the report of government work is a literal style of government agencies' work summary and future work plan issued to the whole public.In the report,it involves the distribution of the attention of the decision-makers,the allocation of resources and the testing of the ability to govern.Therefore,the government work report is not only a written expression of the government's will,but also an authoritative material to measure the level of government's governance.With the passage of time,the emphasis and concerned of government work reports in different periods are diverse.Therefore,it is theoretically and logically feasible to study the report of government work on a specified topic.With the continuous enrichment of information crawling and text statistics analysis technology under the text mining technology in recent years,more and more subject fields have gradually begun to apply text mining technology to application research.The theoretical basis of this paper is mainly from text mining and complex network theory.The corpus we use is the annual work report issued by the State Council and provincial local government from the crawler collection.In the process of data analysis and processing,it is mainly used in the knowledge of Natural Language Processing and probability statistics theory.The text statistics and analysis of the government work report can help the users to obtain the related knowledge quickly,construct the word co-occurrence network to obtain the evolution trend of the topic,and use statistical method and data visualization to assist in mining the knowledge behind the text.The study shows that after the analysis of the 50 years report of the State Council on the work report of the State Council and the symbiotic network knowledge in the field of biology,the modularization coefficient,the close centrality and the relationship map of the subject words and the relationship sub map of the subject words are obtained.We find that there is an asymmetry between the components of the three rural issues in China.The pattern of mutualism is that the allocation of resources between the three topics is biased,and the subject of three rural issues should be implemented on the main body of farmers.Then,after dealing with Chinese word segmentation,noise removal and key word statistics,we first found that 1998 is the watershed of the government work report.After that,the length of the report tends to be stable.The length of the report of government work can reflect the political and economic situation of the corresponding time period.After the statistical analysis of the key words of the five time periods,we found the transfer path of the central government's attention at different periods:ideological struggle,development and economic reform.Finally,based on the division of the four economic regions,the text statistics and the word co-occurrence network analysis of four economic regions in the last 11 years are carried out respectively,and the government work reports in the same year are compared.By dividing the cluster,establishing the cluster word library,and analyzing the keyword distribution of the word group,it can provide the reference for the rapid understanding and comparison of different entities,and provide decision assisting for the decision-makers.Finally,according to the study of three rural issues,this paper puts forward corresponding suggestions.
Keywords/Search Tags:text mining, complex network, word co-occurrence, three rural issues, government work report
PDF Full Text Request
Related items