Font Size: a A A

In View Of The Short Carrier Natural Language Text Information Hiding Technology Research And Implementation

Posted on:2013-05-07Degree:MasterType:Thesis
Country:ChinaCandidate:Y X ZhangFull Text:PDF
GTID:2248330374972215Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Information hiding technique is an effective means of copyright protection of digital products. While text is treated as the most widely used data type, the text information hiding technology has been widespread concerned and in-depth researched. Especially, in recent years, the natural language information hiding became a hot topic of the text information hiding technology. With the rise of micro-blogs, forums, and a variety of review sites, a large number of short texts need copyright protection. But, faced with short text as a carrier, the existing text information hiding algorithms are insufficient of hiding capacity. Therefore, in this paper, the research of short text information hiding algorithm has been studied in the direction of natural language text information hiding technology.In order to solve the problem of insufficient capacity, the short text natural language information hiding algorithm has been proposed in this paper. Based on the application environment and the characteristics of the texts, the algorithm makes use of the redundant space to improve the hiding capacity as far as possible. The algorithm combines the synonym replacement information hiding technique with the changing function words number information hiding technique, and its encoding method is mixed digits encoding. On the one hand, in order to increase the number of words which can be replaced to its synonym, the thesaurus has been build according to the areas of the texts; on the other hand, by changing the number of function words, the secret information which cannot be embedded by the synonym replacement hiding method, can be embedded in the texts. This information hiding algorithm improves the hiding capacity by these two ways. At the same time, with the mixed digits encoding method, it makes full use of each one embedded unit, to express more bits of secret information. The hidden capacity of the algorithm has improved significantly. And this algorithm is suitable for network short text information hidden.In order to safeguard the concealment nature of the algorithm, a word appropriate value calculation method has been proposed in this paper. By the statistical methods, a more appropriate synonym can be chosen. And ensure the replacement word will not destroy the original meaning and readability of the sentence.Finally, a short text information hiding system has been designed and implemented. Experimental data which is obtained in a large corpus, can verify that the algorithm has better concealment and the hiding capacity has been markedly improved.
Keywords/Search Tags:Text Information Hiding, Natural Language Processing, Short Text, SynonymsSubstitution, Add or Delete Function Words, Mixed Digits
PDF Full Text Request
Related items