Font Size: a A A

Construction And Application Of Protein Domain Knowledge Map Based On Neo4j Graph Database

Posted on:2023-08-25Degree:MasterType:Thesis
Country:ChinaCandidate:K P XuFull Text:PDF
GTID:2530307022999109Subject:Software engineering
Abstract/Summary:PDF Full Text Request
Protein domains are the basic units of proteins.In the past two decades,related functional predictions and the continuous establishment and improvement of databases have provided provided biomedical workers with a lot of valuable information,which is of great significance and value in the fields of drug design,antibody therapy,and disease prediction.However,due to the complexity and diversity of biological data,as well as the complexity of concepts and connections between domains,it is difficult to understand the complex relationships between domains using existing research literature and relational databases,and to conduct more in-depth studies.To solve this problem,we innovatively propose to integrate the protein domain related data in the public biological information databases pfam and Uniprot.And the knowledge modeling is carried out according to the complex relationship between the domains,so as to realize the research and analysis of the complex relationship between the protein domains.Connect the Neo4j database through the py2neo component to build a knowledge map of protein domains.At the same time,based on the constructed protein domain knowledge map,an information platform for protein domains is built to provide relevant researchers with basic query of entity relationships of protein domains,as well as three complex analysis queries.It includes five types of domain situations classified according to the combination of domains,common and specific combinations of domains,and important domain nodes in the domain relationship network.This protein domain knowledge graph consists of 500,000 entities and 2 million relationships,used to represent related domain relationships,domain information in public databases,and complex relationships between domain proteins,filling gaps in the domain knowledge graphs.The information platform can present the results to the user more clearly through the entity relationship diagram,and at the same time solve the problem that the existing relational database cannot obtain the relationship between multiple domains,and realize the analysis and query of the combination of domains concerned by biological researchers.
Keywords/Search Tags:Graph database, Knowledge graph, Protein domain
PDF Full Text Request
Related items