Font Size: a A A

The Genome Sequence Cgr Graphics Multifractal Analysis And Application

Posted on:2006-02-19Degree:DoctorType:Dissertation
Country:ChinaCandidate:W J FuFull Text:PDF
GTID:1114360155460555Subject:Biomedical engineering
Abstract/Summary:PDF Full Text Request
Bioinformatics represents a new, growing area of science that uses mathematics, informatics and computational approaches to answer biological questions. Recently years bioinformatics develops greatly with the molecule sequences data's rapidly increasing, and the genome sequence analysis mainly involving gene prediction and molecule evolution, becomes its primary research field. At present lots of arithmetic and software are applied in genome sequence analysis, also more and more perfect sequence analysis methods accelerate the development of bioinformatics. As a strong data processing technique, statistics method gains more attention in genome sequence analysis. Chaos Game Representation (CGR) proposed by Jeffrey basing on iterated function system, represents the word statistics distribution in sequence as the fractal feature of image, therefore the CGR method can be regard as a very handy statistics method of genome sequence analysis. CGR method has some advantages such as independent of long range arrangement and sequence scale and quick computation speed. On the contrary current gene prediction method relys on sequence model, multiple sequence alignment is in the limitation of gene rearrangements and computation complexity, and the different scale genomes are difficult to be compared. Despite of it is exciting to solve the difficulty by CGR method, however CGR method is not widely applied in the bioinformatics till now for its lack of strict mathematics description.Multifrctal theory is the hotspot of recently fractal theory research, it describe the fractal structure in different levels by a spectrum function, then study the distribution rule of characteristic parameters' probability measure by statistics physical method. Recently two years the idea has be proposed that the multifrctal theory can be used in CGR method, but this idea is not be developed to a detailed analysis and application till now. According to this background, we carried out the genome sequence CGR image's multifractal analysis, and discussed the application in important bioinformatics research.First, the capability dimension and informatics dimension of genome sequence's CGR image are calculated, from which it is found that the dimensions is steady in a range of word length. Also the dimensions is changeless when the sequence length is above 5 M, so the 5 M length sequence can represent the large whole human...
Keywords/Search Tags:Bioinformatics, Gene prediction, Molecular evolution, CGR image, Multiple sequence alignment, Gene rearrangements, Complexity, Fractal, Multifractal, Probability set, Scale invariance, General dimension spectrum, Multifractal spectrum
PDF Full Text Request
Related items