Font Size: a A A

Clustering Based On Space Transformation

Posted on:2020-01-05Degree:MasterType:Thesis
Country:ChinaCandidate:L Y HuFull Text:PDF
GTID:2428330626951286Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Clustering based on Space Transformation(CST)is a novel framework on unified clustering.It enhances the effectiveness and universality of a clustering algorithm on various datasets.Meanwhile,it avoids the problem of selecting suitable clustering algorithms and corresponding parameters for unknown datasets.CST uses a new similarity matrix transformed from the original Euclidean distance matrix of a dataset by a nonlinear mapping which is explicit and interpretable for clustering algorithms.Furthermore,under the tranfrormend space of the similarity,more evidences of cluster structures are extracted,which can result in more accurate clustering results by traditional clustering algorithms.Surprisingly,it is also robust to outliers.In this paper,the Normalized Cut is used to show the effectiveness and universality of CST.As a starting point,I researched on two kind of CST and try to figure out the essence part of validity in clustering.To choose a specific new similarity from a candidate set of CSTs,a new proposed internal validity index is uesed.The key of my studies is on designing a good internal validity index.Finally,some clustering mechanisms are involved in clusters or clustering,contributing to a new internal validity index called CVDD.Experimental results show that CVDD outperforms some classic ones and can cope with challenging datasets such as non-spherical clusters,density-separated clusters and datasets with outliers.
Keywords/Search Tags:unified clustering, similarity-based learning, non-spherical clusters, cluster internal validity index, clustering mechanism
PDF Full Text Request
Related items