Subscribe

Issue No.04 - April (2010 vol.22)

pp: 465-478

Upavan Gupta , University of South Florida, Tampa

DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2009.110

ABSTRACT

Data and object clustering techniques are used in a wide variety of scientific applications such as biology, pattern recognition, information systems, etc. Traditionally, clustering methods have focused on optimizing a single metric, however, several multidisciplinary applications such as robot team deployment, ad hoc networks, facility location, etc., require the simultaneous examination of multiple metrics during clustering. In this paper, we propose a novel approach for spatial data clustering based on the concepts of microeconomic theory, which can simultaneously optimize both the compaction and the equipartitioning objectives. The algorithm models a multistep, normal form game consisting of randomly initialized clusters as players that compete for the allocation of data objects from resource locations. A Nash-equilibrium-based methodology is used to derive solutions that are socially fair for all the players. After each step, the clusters are updated using the KMeans algorithm, and the process is repeated until the stopping criteria are satisfied. Extensive simulations were performed on several real data sets as well as artificially synthesized data sets to evaluate the efficacy of the algorithm. Experimental results indicate that the proposed algorithm yields significantly better results as compared to the traditional algorithms. Further, the proposed algorithm yields a high value of fairness, a metric that indicates the quality of the solution in terms of simultaneous optimization of the objectives. Also, the sensitivity of the various design parameters on the performance of our algorithm is analyzed and reported.

INDEX TERMS

Equipartitioning, compaction, game theory, clustering, Nash equilibrium.

CITATION

Upavan Gupta, "A Game Theoretic Approach for Simultaneous Compaction and Equipartitioning of Spatial Data Sets",

*IEEE Transactions on Knowledge & Data Engineering*, vol.22, no. 4, pp. 465-478, April 2010, doi:10.1109/TKDE.2009.110REFERENCES

- [1] A. Amis and R. Prakash, "Load-Balancing Clusters in Wireless Ad Hoc Networks,"
Proc. Third IEEE Symp. Application-Specific Systems and Software Eng. Technology, pp. 25-32, 2000.- [3] P. Berkhin, "Survey of Clustering Data Mining Techniques," technical report, Accrue Software, vol. 10, pp. 92-1460, 2002.
- [4] Y. Chien,
Interactive Pattern Recognition. M. Dekker, 1978.- [7] R. Emery-Montemerlo, G. Gordon, J. Schneider, and S. Thrun, "Game Theoretic Control for Robot Teams,"
Proc. IEEE Int'l Conf. Robotics and Automation (ICRA '05), pp. 1163-1169, 2005.- [8] M. Ester, H. Kriegel, J. Sander, and X. Xu, "A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise,"
Proc. Second Int'l Conf. Knowledge Discovery and Data Mining, pp. 226-231, 1996.- [9] F. Forgó, J. Szép, and F. Szidarovszky,
Introduction to the Theory of Games: Concepts, Methods, Applications. Kluwer Academic Publishers, 1999.- [12] D. Grosu and A. Chronopoulos, "A Game-Theoretic Model and Algorithm for Load Balancing in Distributed Systems,"
Proc. Parallel and Distributed Processing Symp., pp. 146-153, 2002.- [13] D. Grosu and A. Chronopoulos, "Algorithmic Mechanism Design for Load Balancing in Distributed Systems,"
IEEE Trans. Systems, Man and Cybernetics, Part B, vol. 34, no. 1, pp. 77-84, Feb. 2004.- [14] S. Guha, A. Meyerson, and K. Munagala, "Hierarchical Placement and Network Design Problems,"
Proc. 41st Ann. Symp. Foundations of Computer Science, pp. 603-612, 2000.- [15] U. Gupta and N. Ranganathan, "A Microeconomic Approach to Multi-Robot Team Formation,"
Proc. IEEE/RSJ Int'l Conf. Intelligent Robots and Systems, pp. 3019-3024, 2007.- [18] J. Handl and J. Knowles, "Evolutionary Multiobjective Clustering,"
Proc. Eighth Int'l Conf. Parallel Problem Solving from Nature, pp. 1081-1091, 2004.- [20] R. Jain, D. Chiu, and W. Hawe, "A Quantitative Measure of Fairness and Discrimination for Resource Allocation in Shared Computer System," DEC-TR-301, Eastern Research Lab, Digital Equipment Corporation, Sept. 1984.
- [21] S. Kirkpatrick, C. GelattJr., and M. Vecchi, "Optimization by Simulated Annealing,"
Science, vol. 220, no. 4598, pp. 671-680, 1983.- [23] Y. Kwok, S. Song, and K. Hwang, "Selfish Grid Computing: Game-Theoretic Modeling and NAS Performance Results,"
Proc. Int'l Symp. Cluster Computing and the Grid (CCGrid), 2005.- [25] A. Lazar and N. Semret, "A Resource Allocation Game with Application to Wireless Spectrum," technical report, Columbia Univ., 1996.
- [26] J. MacQueen, "Some Methods for Classification and Analysis of Multivariate Observations,"
Proc. Fifth Berkeley Symp. Math. Statistics and Probability, vol. 1, pp. 281-297, 1967.- [27] R. McKelvey, A. McLennan, and T. Turocy, "Gambit: Software Tools for Game Theory," http:/gambit.sourceforge.net, The Gambit Project, 2002.
- [28] R. Murphy, "Human-Robot Interaction in Rescue Robotics,"
IEEE Trans. Systems, Man, and Cybernetics: Part C: Applications and Rev., vol. 34, no. 2, pp. 138-153, May 2004.- [29] F. Murtagh, "A Survey of Recent Advances in Hierarchical Clustering Algorithms,"
Computer J., vol. 26, no. 4, pp. 354-359, 1983.- [30] A. Murugavel and N. Ranganathan, "A Game Theoretic Approach for Power Optimization During Behavioral Synthesis,"
IEEE Trans. Very Large Scale Integration Systems, vol. 11, no. 6, pp. 1031-1043, Dec. 2003.- [32] E. Rasmusen,
Games and Information: An Introduction to Game Theory. Blackwell Publishers, 2001.- [33] S. Saha, S. Sur-Kolay, S. Bandyopadhyay, and P. Dasgupta, "Multiobjective Genetic Algorithm for K-Way Equipartitioning of a Point Set with Application to CAD-VLSI,"
Proc. Ninth Int'l Conf. Information Technology, pp. 281-284, 2006.- [34] N. Sato, F. Matsuno, T. Yamasaki, T. Kamegawa, N. Shiroma, and H. Igarashi, "Cooperative Task Execution by a Multiple Robot Team and Its Operators in Search and Rescue Operations,"
Proc. IEEE/RSJ Int'l Conf. Intelligent Robots and Systems (IROS '04), vol. 2, 2004.- [35] H. Spath,
Cluster Analysis Algorithms for Data Reduction and Classification of Objects. Ellis Horwood, 1980.- [37] A. Vetta, "Nash Equilibria in Competitive Societies, with Applications to Facility Location, Traffic Routing and Auctions,"
Proc. 43rd Ann. IEEE Symp. Foundations of Computer Science, pp. 416-425, 2002.- [39] A. Zarnani, M. Rahgozar, C. Lucas, and F. Taghiyareh, "Spatial Data Mining for Optimized Selection of Facility Locations in Field-Based Services,"
Proc. IEEE Symp. Computational Intelligence and Data Mining, pp. 734-741, 2007. |