Font Size: a A A

Microblogging Language Research

Posted on:2015-07-08Degree:MasterType:Thesis
Country:ChinaCandidate:H W YangFull Text:PDF
GTID:2175330431457370Subject:Chinese Philology
Abstract/Summary:PDF Full Text Request
In this paper, the quantitative research method, exploring people in micro-blog words in sentence, a text rules. From the corpus, real live mass, dynamic, the number of each language feature extraction unit, to observe the micro-blog word, sentence, paragraph, article style, full range, multi angle, to clarify micro-blog language features. The introduction part first uses the bibliometric method, carries on the summary to the present research micro-blog language; and then to high frequency keywords in the literature as the breakthrough point, a review of some representative papers; finally, the significance of this research, methods and innovations are described. The first chapter focuses on the "establishment of micro-blog database", first tells corpus selection, processing and processing, the establishment of the9elements of information input principle and construction, establish "micro-blog table". Secondly, based on the "micro-blog summary", build "list","table" clause, sentence "table" and "dependency treebank," lay the foundation for measurement of discourse, behind the words, sentences, clauses and style. Analysis of quantitative statistics and data of second chapter structure factors to micro-blog discourse. First briefly introduces discourse roles and communication carrier on micro-blog discourse, and quantitative analysis on the relationship between the elements micro-blog discourse structure. Secondly, an analysis of language features micro-blog from macroscopic and the microscopic two angle. Finally, based on the statistical distribution of micro-blog topic length, punctuation, sentence and clause distribution distribution analysis, summed up the micro-blog topic in the form and content features. The third chapter on micro-blog vocabulary, from the macro, micro two angles and the characteristics of the sentence. First of all, through all kinds of comprehensive data investigation about the "vocabulary" statistical", from the macro micro-blog Thesaurus" in terms of overall appearance, structure and specific characteristics. Secondly, the "cumulative coverage rate" calculation, analysis results and the artificial synthesis of some representative words based on word,"social games","psychological field","new media" partition, the micro-blog characteristic markers from the micro perspective, and analyze how they are attached to the society, characterization of psychology and adaptive micro-blog itself. Once again, the number of the number of clauses, clause, micro-blog length of three aspects of a comprehensive consideration of micro-blog sentences in the sentence, to grasp the characteristics and the distribution of the micro-blog macroscopically. Finally, in order to "high frequency" principle as the standard, marker sentences, typical sentence using computer query and V can reflect the general characteristics of micro-blog language this discourse type, provide a paradigm micro-blog sentence from the micro. In the organic link in this paradigm, systematic exploration of the main content of micro-blog characteristics, summarizes the partial structure types of main content. The fourth chapter to micro-blog language as the main body, to the blog language reference, through the metering characteristics used in comparative micro-blog and blog style of distinctive features in the writing style, summarized two. Secondly, from the language style of the point of view of language standardization. In the conclusion part, summarizes the research conclusion, the unsolved problems and further research plan.
Keywords/Search Tags:micro-blog, database, discourse, vocabulary, sentence, style
PDF Full Text Request
Related items