Font Size: a A A

Analysis and Integration of Biological Data for Metabolic Pathway Network Reconstruction by Computational System Biology Approach

Posted on:2016-09-26Degree:Ph.DType:Thesis
University:Sardar Patel University (India)Candidate:Dave, Kirtan PrakashchandraFull Text:PDF
GTID:2470390017481253Subject:Computer Science
Abstract/Summary:
Post-genomic molecular biology embodies high-throughput experimental techniques and hence is a data-rich field. The goal of this research work is to develop bioinformatics methods to utilize publicly available biological data of green algae in order to produce new metabolic pathway knowledge and to aid mining of newly generated data. As an example of knowledge or hypothesis generation from network creation to network analysis, consider function prediction of biological molecules. Assignment of identification of enzyme function is a non-trivial task owing to the fact that one or more potential enzymatic protein may lead to function identification in pathway metabolic network and at the same time involvement of enzymes in different biological processes; depending on their centrality value in particular pathway biological system. The availability of different databases and various approaches of genome annotation will lead to some unorganized architecture of databases, as user point of view organization of annotation information must be data comprehensive and organism specific along with ease in mining without Internet support. The multiple biological data of specific organism are available in scattered format and there is a lack of integrative approach among these biological data. Such genome annotation databases lack in providing the integrative view of different biological entities- especially enzymes and metabolites in a specific network. Therefore, we need to find out different ways to represent biological data in network architecture. Here we apply data integration approach to provide rich representation that enables pathway names based text mining of biological data in terms of integrated networks and conceptual spaces. A new tool called MetAlgNet has been developed during this research work, which follows integrative approach. The publicly available green algae genome annotated data can be used to aid mining of important biological enzymes in metabolic networks. We developed an integrative bioinformatics approach that utilizes publicly available knowledge of enzyme-metabolites interactions, network topological analysis like betweenness, closeness and degree for assigning node importance with quantitative values. Unidentified protein must be assigned to a particular biological protein with the help of Support Vector Machine (SVM) methods. We have reconstructed a phylogenetic tree from a set of specific metabolic enzyme protein sequences with the help of ete2 module in our tool. We applied this approach to create a random network from repetitive observations of the biological data about 1. Chlamydomonas reinhardtii. 2. Ostreococcus lucimarinus. 3. Ostreococcus tauri and 4. Volvox carteri, which are stored in a standalone annotation database. The application of our software revealed importance of role of potential enzymes in biological functions in view of network centrality values, which were calculated by various algorithms. The results provided in this thesis indicate that integration of heterogeneous biological data facilitates advanced mining of data to create metabolic pathway networks. The methods can be applied for gaining insight into functions of enzymes, metabolites and other molecules, as well as for offering interpretation of functional evolution of metabolites with help of topological analysis and reconstruction of phylogenetic tree from sequence data.
Keywords/Search Tags:Data, Network, Metabolic pathway, Approach, Integration
Related items