Weighted K-NN Algorithm And Its Application

Posted on:2006-04-03

Degree:Master

Type:Thesis

Country:China

Candidate:H Chen

Full Text:PDF

GTID:2120360155950335

Subject:Basic mathematics

Abstract/Summary:

PDF Full Text Request

The performance of K-Nearest Neighbor classification algorithm depends on the selection of distance metrics. The Euclidean distance is usually chosen as the similarity measure in the conventional K-NN algorithm, which usually relates to all attributes. When feature weight parameters are introduced to the distance formula, the performance of classification will depend on the weight values and accordingly can be improved by adjusting weight values. A learning feature weights algorithm is introduced to improve the accuracy of classification. Mathematically it corresponds to a linear transformation for a set of points in the Euclidean space. At the same time, different near neighbors have different roles to determine the final classes of testing samples. We not only learned feature weights for each feature, but also weighted the contribution of each of the k neighbors according to their distance to the testing samples, that is, give greater weights to closer neighbors. So we can improve the accuracy of classification. For K value learning in K-NN, this paper puts forward a validity function for judging clustering in order to lead us to use it in K-nearest neighbor classification; then introduces "Generalization Capability of a case"to K-nearest neighbor. According to the proposed approach, the cases with better Generalization Capability are maintained as the representative cases while those redundant cases found in their coverage are removed. We can find a new less but almost complete training data set; consequently reduce complexity of seeking near neighbors.

Keywords/Search Tags:

K-nearest neighbor, K-means, Feature weights, Distance-Weighted, Generalization Capability

PDF Full Text Request

Related items

1	Research On Hash Learning Based Approximate Nearest Neighbor Search Method
2	Research Of Classification Algorithm Based On K Nearest Neighbor
3	Using K-Nearest Neighbor And Multiple Improved Methord To Identify Anti- And PRO-Apoptosis Proteins
4	Research On K-means Clustering Algorithm And Its Application In Stock Investment
5	Recognition Of Essential Proteins Based On Improved Edge Clustering Coefficient And K-nearest Neighbor Algorithm
6	The Prediction Of Geosensor Data Based On The Coupled Spatio-temporal K-nearest Neighbors And Vector Autoregressive
7	Research On Shape Description Based On Complex Network
8	Large Sample Properties Of Nearest Neighbor Estimates For LNQD Samples
9	The Asymptotic Property Of Nearest-Neighbor Density Estimator
10	Analyze The Spatial Distribution Pattern And Accessibility Of Primary And Secondary Educational Resources Using GIS-based Methods