Oil Domain Ontology Construction Based On Document Semantic Recognition

Posted on:2019-01-23

Degree:Master

Type:Thesis

Country:China

Candidate:P H Zhu

Full Text:PDF

GTID:2381330626956575

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

With the continuous development of information technology,the informatization degree of related activities in petroleum field is getting higher and higher,and the application of petroleum field information system to various knowledge bases is more and more frequent.There are many specialized professionals in the petroleum field.The new technologies and terminology are continuously updated and the information is not structured.These problems affect the knowledge representation,information sharing,software reuse and efficient management in the petroleum field.The most classical and most widely used method of knowledge representation is ontology by obtaining the corresponding text file from the existing information sources,and we build the ontology of related fields by handwork or semi-automatic way.At present,there are many problems in the petroleum field,such as independent development system,non-uniform data coding rules,repeated development of various system software and so on.In view of the above problems,this paper proposes a method of ontology construction of petroleum field for semantic recognition of documents,which is mainly divided into the following contents:Document word segmentation is the most important task to construct ontology of petroleum field.There are some characteristics of document terminology and combination words in petroleum field.Based on the hidden Markovian model,an adaptive Hidden Markovian character segmentation model is proposed in this paper,which combines the domain-knowledge dictionary and user-defined information,by introducing the terminology set.The proposed algorithm calibrates character segmentation under semantic constraints and character meaning constraints and could identify professional terms and character combinations in the field of petroleum accurately;We build domain corpus on different scale as information source to extract concepts.By analyzing the statistical method based on TF-IDF and the method based on petroleum dictionary,we design a combined method of both methods under to implement concept extraction under different number of documents.It is proved that the combining method is more accurate in concept extraction;Thirdly,the semantic relationship between the extracted concepts in the petroleum field is identified,the concept is expressed as a word vector according to the Continuous Bag-ofWords(CBOW)model.The word vector is extended and intensified using the improved vector training algorithm to make the word vector contain the context semantics information.The word vector is calculated and imported into Support Vector Machine(SVM)to train SVM classifier.Finally,the hyponymy,part-whole and synonymous relation will be identified.At last,the ontology is constructed automatically by the relation between concept and concept of extraction.The existing ontology learning tools are analyzed to construct the ontology learning system of this paper,and the automatic derivation of Chinese ontology is realized by using the probability ontology model and the data-driven method.This paper mainly uses OWL language,by importing the exported OWL file into the prot�g� platform,further feedback correction is made to finally realize the ontology's visual representation.

Keywords/Search Tags:

Domain ontology, concept extraction, semantic relation identification, word vector

PDF Full Text Request

Related items

1	Automatic Construction Of Ontology Based On Document Retrieval And Semantics Identification In The Oil Field
2	Research On Coal Mine Domain Ontology Concept Update And Uncertainty Reasoning Method
3	Semantic Association Retrieval Based On Oil Domain Ontology
4	Research Of Relation Extraction Method Of Aviation Emergency Text
5	Research About Concept Extraction And Mapping Of Water Environment Ontology Based On Artificial Neural Network Algorithm
6	Research On The Ontology Storage Model And Expansion Mechanism In Oil Field
7	Research On Semantic Query And Reasoning Methods For Aviation Security Events
8	Construction Of Intelligent Clothing Pattern Model Based On Domain Ontology
9	Research On Relativity Of Injection Mould Design Standards Based On Semantic Ontology
10	Research On Sedimentary Facies Of Semantic Classification And Knowledge Database Based On Ontology