Font Size: a A A

An Ontology-based Domain Information Collection And Its Application

Posted on:2017-10-06Degree:MasterType:Thesis
Country:ChinaCandidate:Y LiFull Text:PDF
GTID:2348330503965721Subject:Engineering
Abstract/Summary:PDF Full Text Request
With rapid development of the Internet in the last twenty years, the Internet has become an important way to acquiring knowledge, the knowledge of the Internet mostly as a web page for the people, which is very convenient for people to obtain. And how to get accurate and effective knowledge from the vast Internet information, which requires Web information extraction technology, the technology of web page extraction can extract the information from the web page according to certain rules. It should be noted that the Web extraction technology is not pure text data, but a semi structured web data.Ontology is used to describe the philosophical concept in the field of general knowledge, its basic definition is "collection of domain specific concepts and the relationship" and ontology is suitable for the semantic model of domain knowledge. Ontology application is based on ontology building, which attracted extensive academic research in recent years, it need the participation of experts in the field, they should organize a large number of concepts, knowledge and examples in the field, it will cost a lot of time and experience. Nowadays the Wikipedia information resource on the network is very rich, this paper combines information extraction on the Internet and ontology, we proposed field information acquisition method based on ontology, in the field of a large number of examples and their corresponding relations can semi automated storage in ontology, the convenient ontology construction and the information in the field of acquisition and storage have came ture.In this paper, We propose the method of domain information collection based on ontology. Then We collected film industry related knowledge, construct ontology by Protege, data on Douban movie website was obtained, and integrate data to ontologies through Jena API, using Jess set rules for reasoning, enrich ontology knowledge sets, finally we complete the film information display system by using the Movie Ontology.We store film domain knowledge through the film ontology, it can effectively integrate all homologous movie data, and the data in Douban website is only the representation of data. In this paper, we define the knowledge expansion rules in the field of film, we dig out internal links in the field of film, storage and display the data, but not a mechanical save in the database.
Keywords/Search Tags:domain ontology, Web page information extraction, ontology storage
PDF Full Text Request
Related items