Font Size: a A A

Similarity Analysis Of ND5 Protein Sequence Based On Discrete Wavelet And Fractal Dimension

Posted on:2021-03-21Degree:MasterType:Thesis
Country:ChinaCandidate:X C DangFull Text:PDF
GTID:2370330611481026Subject:Master of Computer Technology
Abstract/Summary:PDF Full Text Request
With the increasing number of biological sequences in gene database and the improvement of sequencing methods,more and more scholars domestic and abroad focus on the study of biological sequence alignment.The analysis of proteins from different species has a better visibility of protein structure,which plays a great role in the research of drug anticancer,among them,the similarity comparison of protein sequences has become one of the important tasks in the field of protein research,helping us to predict and analyze the structure and function of proteins.In order to make the similarity comparison of protein sequence more accurate and effective,a new method is proposed.In this paper,the specific research is as follows:1.In this paper,the mitochondrial NADH dehydrogenase(ND5)sequences of 9 species in NCBI database were used as test sequences to standardize the 10 attribute data of amino acids,including hydrophobicity,p Ka,p Kb,p I,residue weight,VSC,P1,P2,SASA and NCISC.2.Discrete wavelet transform(DWT)used to decompose the digital signal containing biological information.Based on wavelet decomposition,Higuchi algorithm used to study the fractal characteristics of the ND5 sequence protein primary structure.Through the analysis and calculation,the distance matrix between different proteins is obtained,and the existing clustering methods are used to cluster,and good analysis results are obtained.3.Based on the analysis of protein sequences with various amino acid properties,combined with the role and advantages of discrete wavelet and fractal dimension in the analysis of protein sequence similarity,the method can be applied to other protein sequences,which shows the effectiveness of the application of this method.After verifying the effectiveness of the method,by comparing different clustering methods,we can select theappropriate clustering method,and get more accurate and comprehensive results of phylogenetic tree and protein sequence similarity analysis.To sum up,this paper achieved the method of combining DWT and fractal dimension with multiple properties of protein to study protein sequence similarity comparison.The model is more accurate and comprehensive than the existing models,which verifies the reliability of this method.
Keywords/Search Tags:Protein sequence, Similarity analysis, normalization, DWT, fractal dimension
PDF Full Text Request
Related items