Font Size: a A A

Coding And Non-coding Sequence Analyzing Based On Z-curve Theory

Posted on:2009-09-10Degree:MasterType:Thesis
Country:ChinaCandidate:Y L YangFull Text:PDF
GTID:2120360242495071Subject:Optics
Abstract/Summary:PDF Full Text Request
How to find the useful bioinformatics information from the great lots of biological datum is the primary task of post genome era. DNA plays a important role of hereditary language not only in coding sequences but also in non-coding sequences. As we know, non-coding sequences exist in eukaryotic cells abundantly, for example the non-coding sequences proportion in human genome is 95%-97%. Too many non-coding regions have extensive adjustive and manipulative functions in life motions. But most non-coding regions have not been positioned up to now.This paper studied the relative coding sequences of Avian Influenza Virus (AIV) proteins and several kinds of non-coding sequences with different functions by using the Z-curve method proposed by Zhang Chun ting and other informatics methods. Taking H5N1 sequences and four kinds of ncRNA sequences with different functions as studying objects, drawing the corresponding curves of the selected sequences by the software of Z-plotter and calculating the ratioes and the G-C contents of each kind of base in these sequences, then analyzed the results using the method of statistics. These results can provide bases for forecasting the functions of ncRNA. And then this paper distinguished newly the possible 2694 ORFs in Aeropyrum pernix K1genome by using the AZ index in clustering algorithm, according to the AZ index, the ORFs are coding regions if AZ>0, otherwise the ORFs are non-coding regions. The results are that 1581 coding genes and 1113 non-coding genes are determined in Aeropyrum pernix genome.
Keywords/Search Tags:Z-curve, non-coding region, clustering algorithm, gene recognition, ORF, AZ index
PDF Full Text Request
Related items