Font Size: a A A

Numerical Characterization Of Protein Sequences Based On The Generalized Pseudo Amino Acid Composition

Posted on:2018-04-19Degree:MasterType:Thesis
Country:ChinaCandidate:X Q LiFull Text:PDF
GTID:2310330515998884Subject:Computational Mathematics
Abstract/Summary:PDF Full Text Request
With the development and completion of the genome projects of human and other biological genomes,the amount of biological data is increasing exponentially,and the focus of biology is changing from accumulation of data into analysis and interpretation of the data.Bioinformatics arises at the historic moment.The technique of comparison and analysis of biological sequences is playing an increasingly important role in the field of Bioinformatics.One of the key steps in developing the technique is to identify an appropriate manner to represent a biological sequence.This thesis studies the corresponding method for formulating a protein sequence.The main contents are as follows:On the basis of three physical–chemical properties of amino acids,a protein primary sequence is reduced into a six-letter sequence,and then a set of elements which reflect the global and local sequence-order information is extracted.Combining these elements with the frequencies of 20 native amino acids,a (21+?) dimensional vector is constructed to characterize the protein sequence.The utility of the proposed approach is illustrated by phylogenetic analysis and identification of DNA-binding proteins...
Keywords/Search Tags:generalized pseudo amino acid composition, numerical characterization, phylogenetic analysis, identification of DNA-binding proteins
PDF Full Text Request
Related items