Research On Multiple Query Methods On Large Scale RDF Graph

Posted on:2022-10-01

Degree:Master

Type:Thesis

Country:China

Candidate:J J Ma

Full Text:PDF

GTID:2480306320475434

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

With the development of artificial intelligence and the increasing demand for knowledge,knowledge mapping emerges as the times require.Most knowledge maps use RDF to describe resources.SPARQL BGP query on RDF graph is an important technology to describe,mine and analyze all kinds of knowledge maps.SPARQL BGP query on RDF graph is equivalent to subgraph matching problem.Since subgraph matching problem has been proved to be NP problem,how to efficiently query SPARQL BGP on large-scale RDF graph is a challenging problem.In this paper,we design multiple query methods on large-scale RDF graph for semantic queries with different query constraints.Firstly,when executing SPARQL query constrained by predicate constant,we use the distributed storage of star like structure and tag tree index based on predicate constant in this paper to perform query graph preprocessing and star like structure matching process.By reducing the number of query iterations and index efficient pruning filtering effect,we can speed up the retrieval speed,and use E-MJOC algorithm based on star like structure The matching and join order of query subgraphs are determined to optimize the query,and the join cost is reduced by reducing the generation of intermediate results,so as to improve the query efficiency.Secondly,when SPARQL query with mixed constant constraints is executed,the node partition index and coding tree index based on Bloom filter are designed.When preprocessing query graph and matching star like structure,the calculation range is reduced according to the cut point information and node partition index in RDF data to avoid unnecessary calculation of irrelevant storage nodes,and then the coding based on Bloom filter is used The tree index can prune and filter effectively,reduce the number of candidates to be matched,and reduce the cost of matching.Finally,the MS-MJOC algorithm is used to generate the corresponding matching and connection query plan,and the intermediate results are connected based on the common points,which improves the overall query efficiency by reducing a large number of connection operations.Finally,different RDF datasets are used for experimental comparison.The experimental results show that the SPC storage and indexing method and the optimization method of the query itself can promote the semantic query constrained by predicate constant.On the other hand,our SMC performs better than SDec and S2 X in semantic query with mixed constant constraints.

Keywords/Search Tags:

RDF graph, multiple semantic query, subgraph matching, star like structure

PDF Full Text Request

Related items

1	Research Of Subgraph Query Method Based On Frequent Structure In Large-scale Dynamic Graph
2	Research On RDF Subgraph Matching Method In Distributed Environment
3	Studies On Query Processing Techniques Over Uncertain Graph Data
4	Design And Application Of Adaptive Subgraph Matching Algorithm Based On Data Set Characteristics
5	Research Of Subgraph Query On Social Network
6	A Graph Matching Algorithm Based On The Minimal Connected Dominating Subgraph
7	Top-K Interesting Subgraph Matching Approach In The Heterogeneous Graph
8	Research On Efficient Subgraph Matching Algorithm
9	Research On Subgraph Matching Algorithm Based On Lagre-Scale Graph
10	Research On Top-K Interesting Subgraph Query For Large-Scale Dynamic Labeled Graphs