Font Size: a A A

Research On Full Text Retrieval System Of Educational Item Bank Based On Lucene

Posted on:2021-01-01Degree:MasterType:Thesis
Country:ChinaCandidate:Z K QinFull Text:PDF
GTID:2427330605966988Subject:Engineering
Abstract/Summary:PDF Full Text Request
At present,many educational question bank websites already have the ability to search based on the document keywords in the database,but lack the ability to search the full text based on the content.The full-text search is based on the inverted index in the storage form of "word-document matrix",which can quickly obtain the list of documents containing this word according to the word,and improve the retrieval efficiency and accuracy.Based on the Lucene full-text search engine,the thesis develops a full-text search system for the educational question bank.Taking the high school math question bank as an example,the data in the educational question bank can be quickly,accurately and effectively retrieved.The main work contents are as follows:(1)The main research content of this article is full-text retrieval technology,and the main target is the formula of the similarity value;research on the image text extraction technology,the principle and recognition of the open source optical recognition component Tesseract-OCR Process and interface methods were analyzed.(2)This paper makes a detailed investigation and Analysis on the requirements of the full-text retrieval system of education question bank,including the analysis of business requirements,user characteristics and system function requirements.The requirement is transformed into flow chart,use case diagram and function module diagram by UML language.(3)A full-text search system for educational question banks is implemented,including document entry management,document content extraction,index construction,and document retrieval modules.The collected question database corpus is stored through the file system,and the inverted index is established through Lucene.The retrieval module implements document sorting and keyword highlighting.The entire system is built using the MVC framework,which divides the system's data,display,and business functions into different modules,which makes the logic level in the development process clearer and more convenient in later maintenance.The full-text retrieval system of educational question bank helps teachers and students quickly,accurately and effectively retrieve the required content,and promotes the role of educational question bank in student learning and teacher teaching.It has certain reference significance for departments and enterprises with similar needs.
Keywords/Search Tags:full-text retrieval, Lucene, Tesseract-OCR, question bank
PDF Full Text Request
Related items