Font Size: a A A

Optimizing parameters in fuzzy k-means for clustering microarray data

Posted on:2006-03-30Degree:M.ScType:Thesis
University:University of Windsor (Canada)Candidate:Yang, WeiFull Text:PDF
GTID:2458390008960250Subject:Computer Science
Abstract/Summary:PDF Full Text Request
Rapid advances of microarray technologies are making it possible to analyze and manipulate large amounts of gene expression data. Clustering algorithms, such as hierarchical clustering, self-organizing maps, k-means clustering and fuzzy k-means clustering, have become important tools for expression analysis of microarray data. However, the need of prior knowledge of the number of clusters, k, and the fuzziness parameter, b, limits the usage of fuzzy clustering. Few approaches have been proposed for assigning best possible values for such parameters.; In this thesis, we use simulated annealing and fuzzy k-means clustering to determine the optimal parameters, namely the number of clusters, k, and the fuzziness parameter, b. To assess the performance of our method, we have used synthetic and real gene experiment data sets.; To improve our approach, two methods, searching with Tabu List and Shrinking the scope of randomization, are applied. Our results show that a nearly-optimal pair of k and b can be obtained without exploring the entire search space.
Keywords/Search Tags:Clustering, Fuzzy k-means, Microarray, Data, Parameters
PDF Full Text Request
Related items