Font Size: a A A

Genomic, regulatory and functional dynamics of the duplication process

Posted on:2009-01-16Degree:Ph.DType:Thesis
University:University of Southern CaliforniaCandidate:Zhang, KangyuFull Text:PDF
GTID:2443390005458075Subject:Biology
Abstract/Summary:
Duplication is one of the most important mechanisms for evolving gene or genome complexity. Exploring how duplicated genes or genomic regions evolve has become the cornerstone of modern evolutionary theory. In Eukaryotes, over 30% of all genes are confirmed to derive directly from duplication. In the era of genomic data explosion since year two thousand, the flood of complete genome sequences from multiple species significantly facilitated its study. For example, the long debated two round hypothesis of genome duplication in the early stage of vertebrate evolution has been demonstrated via comparing several invertebrate and vertebrate genomes.;In my study I focus on the duplication process from three different aspects: DNA sequence, transcriptional regulation, and gene function. I emphasize the dynamics of the duplication process, e.g. how duplication played a role in the adaptation of the species.;At the DNA sequence level, I modeled how duplicated microsatellites evolve. When a microsatellite locus is duplicated in a diploid organism, a single pair of PCR primers may amplify as many as four distinct alleles. To study the evolution of a duplicated microsatellite, a coalescent model with symmetric stepwise mutation is considered. Conditional on the time of duplication and mutation rate, I computed the probabilities for a sampled diploid individual to amplify one, two, three, or four distinct alleles with one pair of microsatellite PCR primers. These probabilities are then studied to examine the nature of their dependence on the duplication time and the mutation rate. The results can be useful to interpret genetic variation at microsatellite loci in species with a very recent history of gene duplication.;At the transcriptional regulation level, I studied how cis-regulatory elements evolve via a comparative genomics approach. Based on the genomic sequence of 15 fungi species diverged between S.cerevisiae and S.pombe, I determined the presence or absence of ancestral pre- and post-duplication cis-regulatory binding sites, by screening the upstream sequence of the corresponding orthologous genes. This study is the first comprehensive data assessment of how cis-regulation elements evolve in duplicated genes. It demonstrates that the dominant fate of cis-regulatory elements after gene or genome duplication is the emergence of new binding sites, along with asymmetrical loss of ancestral elements in one paralog. This is contrary to previous studies arguing that the distribution of ancestral binding sites into two paralogous duplicates may be more common. My study has implications in further understanding the causality of the complexity in extant transcriptional regulatory network.;At the function level, I proposed a gene ontology based approach to measure functional similarity between gene products across different species. By analyzing orthologous and paralogous duplicated genes in species S.cerevisiae and S.pombe, I discovered heterogeneous rates of functional divergence across different combinations of ontologies and gene sequences. In particular, functional similarities between paralogous genes tend to be higher than in orthologous genes, suggesting convergent evolution among duplicated genes. My method can be further extended to other annotated eukaryotic species. It also has implications for all other ontology based classification systems.
Keywords/Search Tags:Duplication, Genomic, Duplicated genes, Species, Functional, Genome
Related items