Font Size: a A A

The Development Of The Hot Issues News Corpus And Word Study

Posted on:2013-09-06Degree:MasterType:Thesis
Country:ChinaCandidate:Z E MaFull Text:PDF
GTID:2245330395952634Subject:Linguistics and Applied Linguistics
Abstract/Summary:PDF Full Text Request
The domestic researcher of news language has achieved certain results, the papers and monographs about new language have been published, but most of starting point of the researchers are about writing and rhetoric, discussing how to adapt to the requirement about news writing, how to strengthen the linguistics expressive effects, while little things are about the news language base on the social hot events in news corpus.This study researches news language from the perspective of linguistics, using linguistics theory. Firstly, review the modem Chinese register information corpus. Completed daily、legal、business、sports register information corpus provide the first information for the language study, from different registers, base on the corpus of branch register language researches obtained the certain result, the study is news part of the information corpus. Secondly, establish a "social hot events in news corpus" The study includes all the hot news of "Yangzi Evening News," in the year2009, according to the screening criteria, the ultimately number of the selected hot events is489,000, The70%of them are PDF form, we need to use OCR software to convert them into word form, the process of conversion we need to proofread, in order to ensure the correctness of data. To facilitate the future search, proofreading, we need to classify and code the data of the corpus in order to facilitate the future search, proofreading. The register news corpus contains33pieces of hot events, there are365files in the corpus. There is a code in news, along with the title of news, the time, reporters, layout and word count of the report. To determine the properties and the development principles of the corpus, process the corpus according to the development procedures of the corpus. This study adopts to these language data, and the correct by a manual. Discuss these problems which appears in the process, to fit the linguistic characteristics of hot news register, prepare for the corpus-based research, and ultimately to establish a hot news register corpus which is signed. Finally, to make full use of the "social hot events in news corpus". Do word frequency statistics base on the hot events in news register corpus, to make <hot events news vocabulary frequency table> and <hot events news basic vocabulary table>. Compare the hot events news vocabulary (select high-frequency words, second-class high-frequency words, a part of intermediate frequency words) with the general vocabulary, and then extracted216special words. Reference the semantic, classify them according to the sequence of the news events reports. Return to the corpus to search for these special words, according to hot news features, Classify them:"time","event description","network promotion","media intervention","judicature intervention","event affection" six categories. The classification of the special words is not in view of the subjective determination, but the corpus, the distribution of the special words in the corpus to determine their categories, comb out the trigger continuous mode of the hot news on the basis of the classification.The study combine the quantitative and qualitative methods. The "social hot events in news corpus" and the <hot events news basic vocabulary table>, offer reference for the teaching of news subject, the compiling of news dictionary, and the development of news linguistics. The hot events trigger-continuous mode has some enlightenment to report news.
Keywords/Search Tags:hot events, news register, corpus, word frequency statistics, trigger-continuous mode
PDF Full Text Request
Related items