Font Size: a A A

Gene Sequence Alignment Algorithm Research And Implement In SNP

Posted on:2012-12-21Degree:DoctorType:Dissertation
Country:ChinaCandidate:X J KangFull Text:PDF
GTID:1110330344952596Subject:Resources and Environmental Information Engineering
Abstract/Summary:PDF Full Text Request
In recent years, life science research is in developing by leaps and bounds. As the human genome project completed and modem biological technology rapid development, Lots of biological information acquisition has to uncover the mystery of life and provides solid data base. In the time of life science research into the Post-Genome Era, Life science research focus is no longer biological information, but moved to the research of genome function and the changing laws. Therefore the pressing needs have been produced of mass data processing. Meanwhile, computer technology and network technology has a revolutionary development to the massive data processing and provides powerful support, and ultimately have vast pushing effect for human crack the genetic code, grasps the disease of the genetic information, cracked gene function, structure and function prediction.SNPS namely single-nucleotide polymorphisms, It refers to species in the evolutionary process because of the variation in the genome of nucleotides resulting differences between the DNA sequence. It mainly includes bases loss, insert, conversion etc, SNP reflects the difference of genetic information contained in the site is causing some genetic diseases, cancer and other important factors. Gene mutation and SNPS in biological systems, bioinformatics and biomedical research plays a very important role.The expression form of Biological information data is genetic sequence data. Through the comparison of sequence can found the information of the function and structure. Gene double sequence alignment or multi-sequence alignment analysis is one of research hotspot of bioinformatics. For the analysis of the gene sequence is usually adopts clustering algorithms. This paper mainly studies based on sequence alignment algorithm in gene expression data to SNP problem analysis. The main work and innovation points as follows.1) This paper firstly introduces bioinformatics the related concepts and their important sense, and summarized current research status in domestic and abroad.2) For a detailed study on gene expression data commonly used the cluster analysis algorithm, through the experiment we analyzed the algorithms.3) Introduced the research status of Gene sequence alignment algorithm, provides the basis for this paper. 4) Based on the research of sequence alignment algorithm, this paper puts forward in mass gene sequences in the experiment for SNPS data plan design, through the improvement of classic BLAST algorithm we design and realization the algorithm in PC platform and high performance cluster environment. Furthermore we make a detailed analysis and testing through the experimental. Experiments show that this experiment scheme in time complexity and results are obtained in the ideal result.5) As the bases of the algorithm, we design and realized the sequence analysis system based on Windows operating system and cluster platform. Its main functions include gene sequences of derivation, SNPS data analysis, sequence alignment, parameter setting and results data output, the shading treatment check, etc.
Keywords/Search Tags:Gene expression data, Genetic clustering, Sequence alignment, SNP
PDF Full Text Request
Related items