Font Size: a A A

Research And Application On Technology Of Integrating Bioinformatics Resources With Optimization And Parallel Processing

Posted on:2009-07-06Degree:MasterType:Thesis
Country:ChinaCandidate:X LiFull Text:PDF
GTID:2120360242967473Subject:Computer software and theory
Abstract/Summary:PDF Full Text Request
Stepping into the 21st Century, bioinformatics as a new intersecting subject is taken into account by researchers. Bioinformatics is a subject, which is using computer theories and application knowledge to analyze vast biology data and do data mining, so that people could discover, analyze and predict life phenomenon. With the development of Human Genome Project and 1000$Human Genome Project Plan, multiple genome data are created for every field of researches' needs. Although different data are describing different life phenomenon, researches are attempting to find the relationship among the date.As developing of computer technologies, more and more valid methods for analyzing and processing plenty of biology data are available. For instance, Web Service has been an implementation on accessing remote systems and collecting the data resource. Multi-Agent technology has been applied on searching distributed databases, accessing and collecting data using hierarchy. For distributed database queries, there exist traditional optimization technology and modern inverted list indexing. For processing vast data, there is useful Map reduce merge parallel process model. Moreover, unifying the data format is applied on the transferring the great of data. Overall, these methods are promoting researches on bioinformatics and making integration on abundant biology resources based on Web realized, so that researchers are developing biology integration systems which are not only for sharing, comparing and analyzing, but also for mining data.The thesis presents a set of optimization solving strategies for the researches on integration technologies of distributed databases on Internet. That includes such applications of query optimization when integrating biology databases, unifying data formats, Map reduce merge parallel processing model and multi-Agent modules. This paper has the sections based on the theories and implements as follows. Firstly, the paper introduces the research background. Secondly, it also introduces the basic related theories on the research. Thirdly, it describes the system architecture, process workflow and module functions based on these theories, and then puts emphasis on the parallel data processing model. Finally, the paper states the system implement and technology difficulties.
Keywords/Search Tags:Data Integration, Optimization Processing, Map reduce merge Model
PDF Full Text Request
Related items