Font Size: a A A

Data integration and applications of functional gene networks in Drosophila melanogaster

Posted on:2010-04-07Degree:Ph.DType:Thesis
University:Indiana UniversityCandidate:Costello, James ChristopherFull Text:PDF
GTID:2440390002974700Subject:Biology
Abstract/Summary:
Understanding the function of every gene in the genome is a central goal in the biological sciences. This includes full characterization of a genes phenotypic effects, molecular interactions, the evolutionary forces that shape its function(s), and how these functions interrelate. Despite a long history and considerable effort to understand all genes in a genome, mainly focused on "model" organisms, which include bacteria, yeast, worm, flies, and mouse, we are still far from accomplishing this task. For example, the experimentally amenable eukaryotic organism, Saccharomyces cerevisiae (yeast), has a limited set of roughly 6,000 genes, yet we still do not know a single function for over 1,000 of these genes. This problem only gets worse for metazoan organisms. For the fruit fly, Drosophila melanogaster, a major biomedical model organism, experimental evidence exists for only &ap 40% of its roughly 15,000 genes. Both new and improving genomics technologies can help us fully characterized gene function. These assay methods have the advantage of being high-throughput, but can be challenging to interpret. In this thesis, I take a computational approach to address the growing problem of managing, integrating, and analyzing large-scale genomics data for Drosophila melanogaster. This dissertation demonstrates several complementary and novel findings. First, I have shown that disparate sources of data can be integrated together to derive a network with richer information than any individual data source for fly. This result is in support of similar work done in yeast, worm, mouse, and human. Second, I have demonstrated several ways in which the integrated gene networks can be utilized, which include predicting function onto unannotated genes, providing evidence that strong functional relationships tend to show signatures of purifying selection, and using the networks to reanalyze and derive new insight from previously published genome-scale datasets. Third, I have shown that the computational predictions made using the gene networks are reliable through machine learning techniques. Lastly, I have shown that gene networks and experimental data can be used to inform how biological processes are related.
Keywords/Search Tags:Gene, Data, Function, Drosophila
Related items