Font Size: a A A

Construction Of Domain Ontology In Chinese

Posted on:2013-04-02Degree:MasterType:Thesis
Country:ChinaCandidate:X WangFull Text:PDF
GTID:2248330392957838Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Though the utility of domain Ontologies is now widely acknowledged in anincreasing number of domains, Ontology is playing a more and more important role inknowledge management and the Semantic Web. Several barriers must be overcome beforeOntologies become practical and useful tools. A critical issue is the task of identifying,defining the concept definition. In case of large and complex application domains thistask can be lengthy, costly and controversial (since different persons may have differentpoints of view about the same concept). To reduce cost, several automated orsemi-automated construction approach have been proposed, which use machine learningtechnologies, statistic approaches or natural language processing techniques for conceptextracting and relation extracting from existing data sources so as to construct domainontology automatically or semi-automatically.This paper presents a Arisa-based domain ontology construction mechanism toextract domain ontology from thesaurus and domain-related text documentssemi-automatically. The key technologies of ontology construction such as domainconcept extraction and relations extraction are discussed in this paper, includingtaxonomic relations and non-taxonomic relations between concepts. The core concepts canbe got or converted form related thesaurus. Taxonomic relations can be extracted usingimproved concept hierarchical clustering approach referred to related thesaurus.As fornon-taxonomic relations extraction,we choose pattern-matching approach and associationrules to extract from unstructured text documents.Use support and confidence to attainrelated items occurred between the documents which have been proceed by ICTCLASincluding tokenizer and lexical analysis. Additionally, gain relations accurately withdetermined templates considering Chinese syntax.Finally, use Jena API to formalize the concepts and relations in order to generate theOWL ontology,then use Protégé to realize visualization management of OWL ontology.
Keywords/Search Tags:Ontology construction, Relationship extraction, Hierarchical Clustering, Pattern-matching approach, Association rules
PDF Full Text Request
Related items