Font Size: a A A

Statistical issues and novel strategies for expression quantitative trait loci mapping

Posted on:2012-09-12Degree:Ph.DType:Dissertation
University:Michigan State UniversityCandidate:Li, ShaoyuFull Text:PDF
GTID:1453390011454477Subject:Statistics
Abstract/Summary:
Gene regulation is thought to play a pivotal role in determining physiological trait variability by promoting/reducing the expression of functional genes directly or indirectly related to the phenotype. Expression quantitative trait loci (eQTL) mapping studies hold great promise in disentangling gene regulations and have become a popular research area recently. In this dissertation, I explore several statistical strategies, which are applied to eQTL mapping studies, aimed to have a better understanding on the biological mechanism of gene regulation.;The major goal of eQTL studies is to identify genomic regions that are likely to regulate gene expressions. Given that genes function in a network basis, we consider the scenario that a genetic perturbation could lead to a cascade effects on the transcription of multiple genes which belongs to a gene set, e.g., network/pathway. We develop a statistical procedure which incorporates prior biological gene set information (e.g. KEGG pathway, GO terms) into eQTL mapping framework to elucidate gene regulation from a systems biology perspective. Pathway regulators which mediate the expression of genes in a pathway are detected by modeling multiple gene expressions as a multivariate response to test the joint variation changes among different genotype categories. We apply the proposed approach to a yeast eQTL data set. Novel pathway regulators and regulation hotspots are identified.;Currently, most eQTL mapping studies focus on single marker analysis. However, the variability of gene expression may be caused by the regulation of a set of variants that belong to a common genetic system, and individually only with small or moderate effect. To study the roles of genetic systems in regulating gene expressions, we propose a statistical p-value combination approach to combine individual signals across a pre-defined genetic system to form an overall signal, while considering correlations between genetic variants in the system. Results for simulation studies and the application to the yeast eQTL data are presented.;As part of the DNA sequence variation, gene-gene interaction or epistasis has been ubiquitously observed in nature where its role in shaping the development of an organism has been broadly recognized. Investigating genetic interactions related to mRNA expression is an important step on the path to elucidating the genetic architecture underlying gene expression and could provide valuable functional interpretation of gene regulation. As genes are the functional units in living organisms, we conceptually propose a gene-centric gene-gene interaction framework for genome-wide epistasis detection. Multiple genetic markers (e.g. SNPs) in a gene are modeled simultaneously as a testing unit. We develop a model-based kernel machine approach for detecting pairwise gene-gene interactions. Simulation study and applications of the proposed method to the yeast eQTL data indicate its feasibility to eQTL mapping. We further extend the model-based kernel machine method to binary phenotypic outcomes. Our models provide quantitative and testable framework for assessing the interplay between gene expression and gene regulation, and will have great implications for elucidating the genetic architecture of gene expression.
Keywords/Search Tags:Expression, Gene, Trait, Mapping, Statistical, Yeast eqtl data, Quantitative
Related items