New Chinese Words Assisted Identification System Developed

Posted on:2004-08-31

Degree:Master

Type:Thesis

Country:China

Candidate:B Luo

Full Text:PDF

GTID:2205360122971963

Subject:Linguistics and Applied Linguistics

Abstract/Summary:

PDF Full Text Request

There are two basic methods in automatic recognition of unregistered words: statistic-based and linguistic rule-based. Linguists used to interpret the rules of word formation from the perspective of impression, which is hard to offer formalized conclusions, so it is quite difficult to gain computer application. This paper tries to describe the rules of word-building in a relatively quantitative way and makes the conclusion more computational applicable.This paper introduces the development on "Computer-aided Unregistered Words Identification System in Contemporary Chinese", and gives a particular description of the system, including its structrue, algorithm and process. Also, it analyses the recall rate and precision rate of the test result.In our developing process of the system , we combine statistic-based and linguistic rule-based to enable computers to extract possible unregistered words from large running-texts automatically, thus providing modern Chinese dictionary editors with a wait-and-see unregistered word list to support their work on new edition of the dictionary. It will give a sheet with unregistered words to be identified manually by modern Chinese dictionary editors. Also, this system can be used to identify unregistered words in Chinese information processing.Another characteristic of this system is that we based our running text on the People's Daily (electronic edition), which contains about 70,000,000 Chinese characters, and the test results are reasonable.

Keywords/Search Tags:

word-building, Unregistered words, Chinese information processing.

PDF Full Text Request

Related items

1	Research On The Evolution Of Old Words And New Meanings For Chinese Information Processing
2	The Processing Of Subjective Words In Chinese Reading
3	The Construction Of Chinese Morpheme Words Knowledge Base And Its Application In Understanding Unregistered Words
4	The Research On Exploring And Practicing The Advantage Information Of The Phonetic Word
5	The Time Course Of Orthographical And Phonological Processing In Word Recognition Of Chinese Single-character Words With High Frequency:An Eye Movement Study
6	The Effect Of Different Types Of Perceptual Information On Chinese EFL Learners’ Processing Of English And Chinese Object Words
7	Research On The Chinese New Words Developed Since The Reform And Opening
8	The Bottom-up Processing Order Of Word Category Information And Word Meaning Information: Evidence From An ERP Study In Chinese
9	Analysis And Study Of The Characteristics Of Chinese Three-part Causative Complexes Based On Relational Word Collocation
10	Word Segmentations And Words Processing Style In Chinese Reading