2012 16th Panhellenic Conference on Informatics (2012)
Piraeus, Greece Greece
Oct. 5, 2012 to Oct. 7, 2012
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/PCi.2012.69
Data reduction is very important especially when using the k-NN Classifier on large datasets. Many prototype selection and generation Algorithms have been proposed aiming to condense the initial training data as much as possible and keep the classification accuracy at a high level. The Prototype Selection by Clustering (PSC) algorithm is one of them and is based on a cluster generation procedure. Contrary to many other prototype selection and generation algorithms, its main goal is the fast execution of the data reduction procedure rather than high reduction rate. In this paper, we demonstrate that the reduction rate and the classification accuracy of PSC can be improved by generating a larger number of clusters. Moreover, we compare the performance of the particular algorithm with two state-of-the-art algorithms, one selection and one generation, using six real life datasets. The experimental results indicate that the classification performance of the Prototype Selection by Clustering algorithm is comparable with that of its competitors when using many clusters.
Prototypes, Clustering algorithms, Accuracy, Training, Measurement, Partitioning algorithms, Training data, Prototype Selection and Generation, Classification, Clustering, k-Nearest Neighbors, Data Reduction
Stefanos Ougiaroglou, Georgios Evangelidis, "Fast and Accurate k-Nearest Neighbor Classification Using Prototype Selection by Clustering", 2012 16th Panhellenic Conference on Informatics, vol. 00, no. , pp. 168-173, 2012, doi:10.1109/PCi.2012.69