Font Size: a A A

Distance Measures in Bioinformatics

Posted on:2016-11-05Degree:Ph.DType:Dissertation
University:Drexel UniversityCandidate:Xiong, FeiyuFull Text:PDF
GTID:1478390017467035Subject:Biology
Abstract/Summary:
any bioinformatics applications rely on the computation of similarities between objects. Distance and similarity measures applied to vectors of characteristics are essential to problems such as classification, clustering and information retrieval.;This study explores the usefulness of distance and similarity measures in several bioinformatics applications. These applications are in two categories.;(1) Estimation of the adverse reaction severity of unknown pharmaceutical treatments, based on the severity of known treatments, in order to provide guidance for testing of the unknown treatments in clinical trials.;(2) Classification of cancer tissue types and estimation of cancer stages, based on high-dimensional microarray data, in order to support clinical decisions making.;To address the first category, we studied several clustering and classification approaches for binary severity estimation of Cytokine Release Syndrome (CRS). We developed a Severity Estimation using Distance Metric Learning (SE-DML) approach to get graded severity estimation. With binary estimation we were able to identify treatments that caused the most severe response and then built prediction models for CRS. Using the SE-DML approach, we evaluated four known data sets and showed that SE-DML outperformed other widely used methods on these data sets.;For the second category, we presented Kernelized Information-Theoretic Metric Learning (KITML) algorithms that optimize distance metrics and effectively handle high-dimensional data. This learned metric by KITML is used to improve the performance of...
Keywords/Search Tags:Distance, Measures, Data
Related items