Font Size: a A A

Research And Application Of Multilingual Password Corpus Method

Posted on:2022-10-07Degree:MasterType:Thesis
Country:ChinaCandidate:Z Y WuFull Text:PDF
GTID:2518306569981559Subject:Software engineering
Abstract/Summary:PDF Full Text Request
Passwords are the main identity authentication and information encryption method on the Internet today,and it plays a very important role in protecting the privacy of users.This article starts with the regularity of user password vocabulary,and focuses on the characteristics of password vocabulary and the construction method of password corpus.We use hot word and structural password guessing methods to increase the password guessing hit rate by expanding the password corpus set.It has important application value for password guessing and password evaluation.This article first analyzes the characteristics of the real password set.From the perspectives of password length distribution,character structure types,vocabulary types,etc.,the similar laws and different characteristics of password sets in different regions are explored.Then compare several common corpus expansion methods on the corpus expansion method.Based on lexical characteristics,we collected and organized a large number of vocabulary materials through a variety of ways.At the same time,we also completed the Latinization of vocabulary in 8 languages,including Russian,Japanese,and Chinese,as a preliminary preparation for the construction of the corpus.Secondly,we designed the core steps of the six corpus construction methods of cleaning,latinization,fixed length,expansion,conflict resolution,and segmentation,introduced the impact of different processes on the corpus,and developed a set of automated corpus construction tools to achieve the above Operation process.Finally,based on the above process,we constructed a million-level vocabulary level,including English,Chinese,Japanese,Russian and other multilingual password corpus.The corpus files are stored in accordance with the vocabulary language,category,length,and deformation.In order to verify the improvement effect of the multilingual password corpus on password cracking,a comparative experiment was carried out by comparing the Japanese corpus without the Japanese corpus and the Japanese corpus.The results show that the hit rate of the Japanese corpus included in the number of guesses of 1010~1014is higher than that of the Japanese corpus not included,and the improvement rate of the hit rate in the guess result is as high as 10%,which shows that the password corpus is of great help in the guessing process.
Keywords/Search Tags:password feature analysis, password guessing, target rule method, deformed corpus product rule
PDF Full Text Request
Related items