| Search engines and desktop document search in recent years has developed rapidly. But massive contents search, personalized services are still passable. They simply find files and make a brief process. Knowledge in users' documents are not mined efficiently.The work of this thesis is described as the following:(1) Basic concepts and history of search engine, the development and advantages of desktop (document) search engine are analyzed;(2) Some algorithms are proposed; A large amount of data and some testing tools are used to assure accuracy and high efficiency of models and algorithms;(3) More detailed work of design and implementation is done; Top-down approach is mainly used; from the perspective of system structure of each module and package design, the thesis has a clear structure;(4) A number of tools related to software testing are used to do functional testing, pressure testing and etc.; Performance analysis and optimization of the whole system is also done;(5) The follow-up work to be done is concisely analyzed and prospects are presented while the work accomplished in this thesis is summarized. |