The Community for Technology Leaders
RSS Icon
Issue No.11 - Nov. (2013 vol.19)
pp: 1833-1845
S. Lafon , Comput. Sci. Lab., Univ. Francois-Rabelais of Tours, Tours, France
F. Bouali , Comput. Sci. Lab., Univ. Francois-Rabelais of Tours, Tours, France
C. Guinot , Comput. Sci. Lab., Univ. Francois-Rabelais of Tours, Tours, France
G. Venturini , Comput. Sci. Lab., Univ. Francois-Rabelais of Tours, Tours, France
In this paper, we propose a new method for the visual reorganization of online analytical processing (OLAP) cubes that aims at improving their visualization. Our method addresses dimensions with hierarchically organized members. It uses a genetic algorithm that reorganizes k-ary trees. Genetic operators perform permutations of subtrees to optimize a visual homogeneity function. We propose several ways to reorganize an OLAP cube depending on which set of members is selected for the reorganization: all of the members, only the displayed members, or the members at a given level (level by level approach). The results that are evaluated by using optimization criteria show that our algorithm has a reliable performance even when it is limited to 1 minute runs. Our algorithm was integrated in an interactive 3D interface for OLAP. A user study was conducted to evaluate our approach with users. The results highlight the usefulness of reorganization in two OLAP tasks.
Visualization, Genetic algorithms, Three-dimensional displays, Data visualization, Genetics, Sociology, Statistics,interactive knowledge discovery, Dimension reorganization, visual OLAP
S. Lafon, F. Bouali, C. Guinot, G. Venturini, "Hierarchical Reorganization of Dimensions in OLAP Visualizations", IEEE Transactions on Visualization & Computer Graphics, vol.19, no. 11, pp. 1833-1845, Nov. 2013, doi:10.1109/TVCG.2013.93
[1] E. Codd, S. Codd, and C. Salley, "Providing OLAP to User-Analysts: An IT Mandate," technical report, E.F. Codd and Assoc., 1993.
[2] Q. Chaudhuri and U. Dayal, "An Overview of Data Warehousing and OLAP Technology," ACM SIGMOD Record, vol. 26, no. 1, pp. 65-74, 1997.
[3] E. Thomsen, OLAP Solutions: Building Multidimensional Information Systems. Wiley, 2002.
[4] A. Cuzzocrea and S. Mansmann, "OLAP Visualization: Models, Issues, and Techniques," Encyclopedia of Data Warehousing and Mining, second ed., pp. 1439-1446, 2009.
[5] S. Lafon, F. Bouali, C. Guinot, and G. Venturini, "On Studying a 3D User Interface for OLAP," Data Mining and Knowledge Discovery, vol. 27, no. 1, pp. 4-21, 2013.
[6] H. Siirtola and E. Mäkinen, "Constructing and Reconstructing the Reorderable Matrix," Information Visualization, vol. 4, no. 1, pp. 32-48, 2005.
[7] M. Ankerst, S. Berchtold, and D.A. Keim, "Similarity Clustering of Dimensions for an Enhanced Visualization of Multidimensional Data," Proc. IEEE Symp. Information Visualization, pp. 52-60, 1998.
[8] W. Peng, M.O. Ward, and E.A. Rundensteiner, "Clutter Reduction in Multi-Dimensional Data Visualization Using Dimension Reordering," Proc. IEEE Symp. Information Visualization, pp. 89-96, 2004.
[9] L. Di Caro, V. Frias-Martinez, and E. Frias-Martinez, "Analyzing the Role of Dimension Arrangement for Data Visualization in Radviz," Proc. 14th Pacific-Asia Conf. Advances in Knowledge Discovery and Data Mining, pp. 125-132, 2010.
[10] S. Climer and W. Zhang, "Rearrangement Clustering: Pitfalls, Remedies, and Applications," The J. Machine Learning Research, vol. 7, pp. 919-943, 2006.
[11] W.T. McCormick, P.J. Schweitzer, and T.W. White, "Problem Decomposition and Data Reorganization by a Clustering Technique," Operations Research, vol. 20, no. 5, pp. 993-1009, 1972.
[12] J. Bertin, Graphics and Graphic Information-Processing, Translated by William J. Berg and Paul Scott. de Gruyter, 1981.
[13] G. Caraux, "Réorganisation et Représentation Visuelle D'une Matrice De Données Numériques: Un Algorithme Itératif," Revue de statistique Appliquée, vol. 32, no. 4, pp. 5-23, 1984.
[14] D.L. Vandev and Y.G. Tsvetanova, "Ordering of Hierarchical Classifications, Draft, (1997)," 1997.
[15] Z. Bar-Joseph, D.K. Gifford, and T.S. Jaakkola, "Fast Optimal Leaf Ordering for Hierarchical Clustering," Bioinformatics, vol. 17, pp. 22-29, 2001.
[16] S.A. Morris, B. Asnake, and G.G. Yen, "Optimal Dendrogram Seriation Using Simulated Annealing," Information Visualization, vol. 2, no. 2, pp. 95-104, 2003.
[17] Z. Bar-Joseph, E.D. Demaine, D.K. Gifford, M. Hamel, and T.S. Jaakkola, "K-Ary Clustering with Optimal Leaf Ordering for Gene Expression Data," Bioinformatics, vol. 19, no. 9, pp. 1070-1078, 2003.
[18] J. Yang, W. Peng, M.O. Ward, and E.A. Rundensteiner, "Interactive Hierarchical Dimension Ordering, Spacing and Filtering for Exploration of High Dimensional Datasets," Proc. IEEE Symp. Information Visualization, pp. 105-112, 2003.
[19] Y.W. Choong, D. Laurent, and P. Marcel, "Computing Appropriate Representations for Multidimensional Data," Data Knowledge Eng., vol. 45, no. 2, pp. 181-203, 2003.
[20] R. Ben Messaoud, S. Loudcher, O. Boussaid, and F. Bentayeb, "OpAC: A New OLAP Operator Based on a Data Mining Method," Proc. Sixth Int'l Baltic Conf. Databases and Information Systems, pp. 417-420, 2004.
[21] R. Ben Messaoud, O. Boussaid, and S. Loudcher Rabaséda, "A Multiple Correspondence Analysis to Organize Data Cubes," Proc. Conf. Databases and Information Systems IV: Selected Papers from the Seventh Int'l Baltic Conf., pp. 133-146, 2007,
[22] F. Sureau, F. Bouali, and G. Venturini, "On Improving OLAP Visualizations with Rearrangement Clustering," Proc. First Joint Meeting of the Société Francophone de Classification and the Classification and Data Analysis Group of the Italian Statistical Soc. (SFC-CLADAG '08), pp. 417-420, 2008,
[23] A. Inselberg, Parallel Coordinates: Visual Multidimensional Geometry and Its Applications. Springer, 2009.
[24] P. Hoffman, G.G. Grinstein, K.A. Marx, I. Grosse, and E. Stanley, "DNA Visual and Analytic Data Mining," Proc. Conf. Visualization, pp. 437-442, 1997.
[25] J.-D. Fekete and N. Henry, "Matrix Reordering Survey," Visualisation Summer School, Peking Univ., 2009.
[26] I. Liiv, "Seriation and Matrix Reordering Methods: An Historical Overview," Statistical Analysis and Data Mining, vol. 3, no. 2, pp. 70-91, 2010.
[27] M. Garcia, W. Huang, C. Seifert, and W. Wallisch, "Literature Survey: The Reorderable Matrix, Draft," 2010.
[28] G.N. Lance and W.T. Williams, "A General Theory of Classificatory Sorting Strategies: I. Hierarchical Systems," Computer J., vol. 9, no. 4, pp. 373-380, 1967.
[29] D. Barbará and M. Sullivan, "Quasi-Cubes: Exploiting Approximations in Multidimensional Databases," SIGMOD Rec., vol. 26, no. 3, pp. 12-17, 1997.
[30] J.H. Holland, Adaptation in Natural and Artificial Systems. Univ. of Michigan Press, 1975.
[31] J. Koza, Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, 1992.
[32] P. Larranaga, C.M.H. Kuijpers, R.H. Murga, I. Inza, and S. Dizdarevic, "Genetic Algorithms for the Travelling Salesman Problem: A Review of Representations and Operators," Artificial Intelligence Rev., vol. 13, no. 2, pp. 129-170, 1999.
[33] I.M. Oliver, D.J. Smith, and J.R.C. Holland, "A Study of Permutation Crossover Operators on the Traveling Salesman Problem," Proc. Second Int'l Conf. Genetic Algorithms on Genetic Algorithms and Their Application, pp. 224-230, 1987.
[34] L.D. Whitley, "The Genitor Algorithm and Selection Pressure: Why Rank-Based Allocation of Reproductive Trials is Best," Proc. Third Int'l Conf. Genetic Algorithms (ICGA) pp. 116-123, 1989.
[35] A. Ammoura, O. Zaiane, and R. Goebel, "Towards a Novel OLAP Interface for Distributed Data Warehouses," Proc. Third Int'l Conf. Data Warehousing and Knowledge Discovery (DaWaK '01), vol. 2114, pp. 174-185, 2001.
[36] G. Verbeke and G. Molenberghs, Linear Mixed Models in Practice: A SAS-Oriented Approach. Springer, 1997.
98 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool