Font Size: a A A

Research And Design On Data Mining In Intelligent Question Answering System

Posted on:2011-05-01Degree:MasterType:Thesis
Country:ChinaCandidate:D D YanFull Text:PDF
GTID:2198360305971597Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Data mining is the process of abstracting unaware, potential and useful information and knowledge from plentiful, incomplete, noisy, fuzzy and stochastic data. By statistics , analysis, synthesis and reasoning the dates it finds out the correlation of dates, the future trend and general knowledge and so on which are used to guide senior businesses.Question and answering is not only a process of classroom teaching but also an important module of web-based course.The QA system working for making full use of the resource of knowledge base in the educational net to realize question answering and improve the quality and efficiency of the teaching.The paper analyzes the existing question-answering system, studies and implements an intelligent question-answering system based on web with a BBS functions. The question-answering system implements the user registration,login,question and answer, and it completes the user asynchronous clarification through the BBS and let users have a wider range of platform. The paper applies the data mining algorithm into the QA system, puts forward a set of scheme about question answering system based on data mining algorithm and realize it. The aim of the scheme is to give up some defects of current question answering system and get an efficiency QA system. The general thought:an improved association rules algorithm based on keywords is applied to calculate the correlation value between words in order to get the similarity of questions.The best answer can be found by the max similarity value.We call get the one to one QA pairs,then the text clustering is performed on the QA pairs.The questions would be saved by classify.By using the association rules algorithm into the every class after text clustering,the more accurate association table for extracting the better answers from the database can be gotten,and the similarity could be improved. By this way a comprehensive and accuracy QA database can finally be formed that can be used to data mining.Finally the similarity based on words association value is used to answer the question that the users asked,and an intelligent QA system could be gotten.
Keywords/Search Tags:data warehouse, data mining, association rules, correlation analysis, text clustering
PDF Full Text Request
Related items