Font Size: a A A

A Clustering Rule Based Approach for Classification Problems

Posted on:2011-09-13Degree:Ph.DType:Dissertation
University:Auburn UniversityCandidate:Williams, Philicity KapryelleFull Text:PDF
GTID:1448390002466723Subject:Computer Science
Abstract/Summary:PDF Full Text Request
Today's data storage and collection abilities have allowed the accumulation of enormous amounts of data. Data mining can be a useful tool in transforming these large amounts of raw data into useful information. Predictive modeling is a very popular area in data mining. The results of these type tasks can contain helpful information that can be used in decision making. Problems arise when the data sets that are used to build these models are not as complete (e.g. erroneous/missing values) as the data used to evaluate the model. Rule based classifiers are widely used and accepted type of predictive model. We present a method to reduce the severity of the effects of missing data on the performance of rule base classifiers using divisive data clustering. The Clustering Rule based Approach (CRA) clusters the original training data and builds a separate rule based model on the cluster wise data. The individual models are combined into a larger model and evaluated against test data. We evaluate the effects of the missing attribute information for ordered and unordered rule sets. We experimentally show that the collective model is less affected by missing attribute information when the test data has missing attribute values.
Keywords/Search Tags:Data, Rule, Missing attribute, Model, Clustering, Information
PDF Full Text Request
Related items