Font Size: a A A

Research On Visualization Of Gene Sequence Alignment Based On Variant Theory

Posted on:2021-07-17Degree:MasterType:Thesis
Country:ChinaCandidate:X ZhangFull Text:PDF
GTID:2480306230478294Subject:Software engineering
Abstract/Summary:PDF Full Text Request
Random sequence is a sequence of random variables.It has a very important position in both probability theory and statistics.As a natural random sequence,biological nucleic acid sequence has received extensive attention.As the carrier of biological genetic information,nucleic acid sequence plays a vital role in the origin and development of organisms.In recent years,with the continuous development of natural science and sequencing technology,related data has grown rapidly.How to analyze and process these data has become the focus of scientists.The most basic but most important part is sequence alignment.A lot of foreign countries have done a lot of research on sequence comparison algorithms,and based on this,many related products have been launched.However,China is still in its infancy in this respect.Therefore,we should devote more attention to this aspect.The current sequence comparison algorithms mainly include sequence comparison algorithms based on the idea of dynamic programming and heuristic sequence comparison algorithms,whose core is to compare sequence pairs one by one and introduce a scoring mechanism to find the optimal of two sequences Align the situation and use it to conduct homology analysis and other studies on the sequence.Among them,the most representative and most extensive algorithms are Needleman-Wunsh algorithm and Smith-Waterman algorithm.Among them,the Needleman-Wunsh algorithm is a global comparison algorithm,and the Smith-Waterman algorithm is a local comparison algorithm developed on the basis of the Needleman-Wunsh algorithm.They all have very high comparison sensitivity,but the comparison speed is very slow and the memory overhead is huge.Based on the theory of variant theory,this paper introduces the theory of statistics and proposes a visualization method suitable for sequence alignment.In this method,the different sequences to be compared are divided into equal-length subsequence fragments according to a certain length,and the number of different bases in each segment is statistically combined to obtain a separation curve between different sequences.To determine the similarity between sequences.The visualization method has the characteristics of obvious patterns,clear curves,intuitive effects,high space-time efficiency,and easy analysis.This method can reduce the time and memoryoverhead of the sequence comparison process on the basis of ensuring accuracy,greatly improve efficiency,and make a positive contribution to sequence comparison.
Keywords/Search Tags:Sequence alignment, variant theory, visualization, separation curve
PDF Full Text Request
Related items