Font Size: a A A

Study On Heterogeneous Bioinformatics Database Integration

Posted on:2011-08-28Degree:MasterType:Thesis
Country:ChinaCandidate:J MaFull Text:PDF
GTID:2248330374495175Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Now biology investigation has stepped into the past-genome era, Because the Human Genome Project (HGP for short) in the world have made great efforts to develop, a large number of bio-data (DNA, RNA and Protein data etc.) has been produced in researches need to be analyzed and processed, to deal with these data with the help of computer is considered an important topic in the application of computer in biological research field. However due to the mentioned data has various sources, one problem in the bio-data processing and analyzing need be solved is how to query the data information the research need.To solve the problem of data isomerism existing in biological data organizing and processing, this paper constructed a XML-based Bio-data Integration System and a common data model which was applied to it, to shields the biological information data isomerism and provides a unified application platform to the user, provides the basic condition for the biological information two level of database constructions.As a meta-language, XML has actually become the standard language for data expressing and exchanging on the Internet. XML is powerfully capable of data describing. It is a kind of structured descriptive language. Adopting the tree-like memory structure and allowing the deep-structure-embedded expression. All of these characteristics make it extremely suitable to describe uniformly the marine data with complex structures so as to simplify the data exchanging and realize the data sharing.This paper introduces XML language, biological data and its features, memory and classified, analyzed several commonly used methods of data integration, has carried on the comparison to XML and the relational database mutual transformation, has given the solution biology data isomerism method. The paper designed XML-based Bio-data integrated system, including specific aspects:system design, system UI design, and the key technical of the systems. And give the XML-based Bio-data integrated system.The specific contents including:an analysis of Bio-data integration issues in the syntax isomerism and semantics isomerism problems; use the powerful capacity of XML to describe data, easy to express structured data and semi-structured data, in particular, used for data integration intermediate format, such as the characteristics of the bio-data solve the problem of data syntax isomerism. Firstly, this paper brief introduced biological data and its features, analyzed all the problems encountered in storing, describing and integrating biological data, and then proposed the topic of this paper:design of a system. The system constructed in this paper mainly consists of web isomerism data base of data source layer, data wrapper of data query layer, query dissector of data integration layer, XML data wrapper, and web server of web service layer.
Keywords/Search Tags:Data isomerism, Integration of biological data, XML, IntegratedSystem
PDF Full Text Request
Related items