Font Size: a A A

Automatically Get To Build The Study Of Biological Information Platform And Sequence Alignment Algorithm Based On Information

Posted on:2007-03-23Degree:MasterType:Thesis
Country:ChinaCandidate:S ZhuFull Text:PDF
GTID:2208360212475481Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
21st century is the century of life sciences. Bioinformatics has gottenunprecedented development in recent years. Bioinformation Secondary Database is oneof the important research directions in bioinformatics. Due to the complexity of bothbioinformation data and their applications, there is no relatively common frameworkmodel so far to meet the development need of general bioinformation secondarydatabase systems. Sequence alignment is the most important manipulation and thefundamental information processing method in bioinformatics. Aligning a number ofcumulated nucleic acid and protein sequences is significant for discovering functional,structural, and evolutionary information in biological sequences.In this thesis, it introduces the development, research and database ofbioinformatics first, and then some of alignment methods are described, such as dotmatrix sequence comparison, dynamic programming algorithm for sequence globalalignment, Smith-Waterman algorithm, FASTA algorithm, BLAST algorithm and so on.Analyze the purpose and its meaning of the research of sequence alignment also.This thesis achieved a framework system about creating bioinformatics researchplatform by the related technique of the .NET, XML and the Web Services. Thisplatform is automatic to obtain bioinformation from Internet and build up a localsecondary database of bioinformation. The point introduces to extraction and analysisnetwork database resources with the WebClient class method and establishes a localsecondary database of bioinformation. Carry out some operation of view and search etc.of this local secondary database with the ASP.NET and ADO.NET. It envelopsalgorithm of sequence alignment by Web Services technology can it can be called byclient. This system adopt Web information obtains automatically on the .NET technique,use XML document store data downloaded from web site, and combine the technique ofWeb services to contribute to the local secondary database developer to find out thebioinformation of real demand quickly in the sea the quantity of the information source.And take into flexibly applied, come up more energy concentrations in the more purebioinformation processing. It provides the convenient and efficient resource of public bioinformatics technology services platform toward the Internet customer in the way ofweb.And According to the thought of encoding sequence by hexadecimal and lookingfor the optimal variable window size, this thesis proposes an algorithm namedSLAHAW about similar nucleotide sequence search based on BLAST method. Thisalgorithm applies hexadecimal code to store data and gets the optimal search windowvalue through the sequence segments similarity degree. It raised speed and the accuratedegree and economized the store space. Build up an experiment environment andcarried out to correspond algorithms. Under the situation of the sequences satisfysimilarity degree, the experiment proves that the SLAHAW is a fast and valid algorithmof matching similar sequence segments.
Keywords/Search Tags:Bioinformatics, Sequence Alignment, Search Algorithm, Secondary Database
PDF Full Text Request
Related items