The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.04 - Oct.-Dec. (2013 vol.6)
pp: 484-496
Jianmin Wang , Tsinghua University, Beijing
Raymond K. Wong , University of New South Wales, Sydney
Jianwei Ding , Tsinghua University, Beijing
Qinlong Guo , Tsinghua University, Beijing
Lijie Wen , Tsinghua University, Beijing
ABSTRACT
While many process mining algorithms have been proposed recently, there does not exist a widely accepted benchmark to evaluate and compare these process mining algorithms. As a result, it can be difficult to choose a suitable process mining algorithm for a given enterprise or application domain. Some recent benchmark systems have been developed and proposed to address this issue. However, evaluating available process mining algorithms against a large set of business models (e.g., in a large enterprise) can be computationally expensive, tedious, and time-consuming. This paper investigates a scalable solution that can evaluate, compare, and rank these process mining algorithms efficiently, and hence proposes a novel framework that can efficiently select the process mining algorithms that are most suitable for a given model set. In particular, using our framework, only a portion of process models need empirical evaluation and others can be recommended directly via a regression model. As a further optimization, this paper also proposes a metric and technique to select high-quality reference models to derive an effective regression model. Experiments using artificial and real data sets show that our approach is practical and outperforms the traditional approach.
INDEX TERMS
Computational modeling, Benchmark testing, Feature extraction, Training, Heuristic algorithms, Organizations,benchmark, Business process mining, evaluation
CITATION
Jianmin Wang, Raymond K. Wong, Jianwei Ding, Qinlong Guo, Lijie Wen, "Efficient Selection of Process Mining Algorithms", IEEE Transactions on Services Computing, vol.6, no. 4, pp. 484-496, Oct.-Dec. 2013, doi:10.1109/TSC.2012.20
REFERENCES
[1] J. Bae, L. Liu, J. Caverlee, L.J. Zhang, and H. Bae, "Development of Distance Measures for Process Mining, Discovery and Integration," Int'l J. Web Service Research, vol. 4, no. 4, pp. 1-17, 2007.
[2] M. Becker and R. Laue, "Analysing Differences between Business Process Similarity Measures," Proc. Int'l Workshop Process Model Collections, 2011.
[3] M. Bishop and F.R. Eng, Pattern Recognition and Machine Learning. Springer, 2006.
[4] J. Carmona, J. Cortadella, and M. Kishinevsky, "A Region-Based Algorithm for Discovering Petri Nets from Event Logs," Proc. Sixth Int'l Conf. Business Process Management (BPM '08), pp. 358-373, 2008.
[5] A.K.A. de Medeiros, A.J.M.M. Weijters, and W.M.P. van der Aalst, "Genetic Process Mining: An Experimental Evaluation," Data and Knowledge Eng., vol. 14, pp. 245-304, 2007.
[6] J. Desel and G. Juhas, "What Is a Petri Net?" Unifying Petri Nets, pp. 1-25, Springer-Verlag, 2001.
[7] R. Dijkman, M. Dumas, B. van Dongen, R. Käärik, and J. Mendling, "Similarity of Business Process Models: Metrics and Evaluation," Information Systems, vol. 36, no. 2, pp. 498-516, Apr. 2011.
[8] R.M. Dijkman, M. Dumas, and L. García-Bañuelos, "Graph Matching Algorithms for Business Process Model Similarity Search," Proc. Seventh Int'l Conf. Business Process Management, pp. 48-63, 2009.
[9] M. Dumas, L. García-Bañuelos, and R.M. Dijkman, "Similarity Search of Business Process Models," Bull. IEEE Technical Committee on Data Eng., vol. 32, no. 3, pp. 25-30, 2009.
[10] R.A. Horn and C.R. Johnson, Matrix Analysis. Cambridge Univ. Press, 1985.
[11] IEEE Task Force on Process Mining, "Process Mining Manifesto," Proc. Int'l Conf. Business Process Management, pp. 169-194, 2011.
[12] T. Jin, J. Wang, and L. Wen, "Efficiently Querying Business Process Models with BeehiveZ (Demo Paper)," Proc. Seventh Int'l Conf. Business Process Management (BPM '11), 2011.
[13] I.T. Jolliffe, Principal Component Analysis. Springer-Verlag, 1986.
[14] J. Mendling, "Testing Density as a Complexity Metric for EPCS," Proc. German EPC Workshop Density of Process Models, 2006.
[15] J. Mendling, Metrics for Process Models: Empirical Foundations of Verification, Error Prediction, and Guidelines for Correctness. Springer, 2008.
[16] J. Mendling, M. Moser, G. Neumann, H.M.W. Verbeek, B.F. van Dongen, and W.M.P. van der Aalst, "Faulty EPCs in the SAP Reference Model," Proc. Fourth Int'l Conf. Business Process Management (BPM '06), pp. 451-457, 2006.
[17] J. Mendling, H.A. Reijers, and J. Cardoso, "What Makes Process Models Understandable?" Proc. Fifth Int'l Conf. Business Process Management (BPM '07), pp. 48-63, 2007.
[18] J. Mendling, H.M.W. Verbeek, B.F. van Dongen, W.M.P. van der Aalst, and G. Neumann, "Detection and Prediction of Errors in EPCs of the SAP Reference Model," Data and Knowledge Eng., vol. 64, no. 1 pp. 312-329, 2008.
[19] T. Murata, "Petri Nets: Properties, Analysis and Applications," Proc. IEEE, vol. 77, no. 4, pp. 541-580, Apr. 1989.
[20] A. Rozinat, A.K.A. de Medeiros, C.W. Günther, A.J.M.M. Weijters, and W.M.P. van der Aalst, "Towards an Evaluation Framework for Process Mining Algorithms," BPM Center Report BPM-07-06, BPMcenter.org, 2007.
[21] A. Rozinat, A.K.A. de Medeiros, A.J.M.M. Weijters, and W.M.P. van der Aalst, "The Need for a Process Mining Evaluation Framework in Research and Practice," Proc. Int'l Conf. Business Process Management, pp. 84-89, 2008.
[22] W.M.P. van der Aalst, "The Application of Petri Nets to Workflow Management," J. Circuits, Systems and Computers, vol. 8, no. 1, pp. 21-66, 1998.
[23] W.M.P. van der Aalst, B.F. van Dongen, J. Herbst, L. Maruster, G. Schimm, and A.J.M.M. Weijters, "Workflow Mining: A Survey of Issues and Approaches," Data and Knowledge Eng., vol. 47, pp. 237-267, 2003.
[24] W.M.P. van der Aalst, A.J.M.M. Weijters, and L. Maruster, "Workflow Mining: Discovering Process Models from Event Logs," IEEE Trans. Knowledge and Data Eng., vol. 16, no. 9, pp. 1128-1142, Sept. 2004.
[25] B.F. van Dongen, A.K.A. de Medeiros, H.M.W. Verbeek, A.J.M.M. Weijters, and W.M.P. van der Aalst, "The Prom Framework: A New Era in Process Mining Tool Support," Proc. 26th Int'l Conf. Applications and Theory of Petri Nets, pp. 444-454, 2005.
[26] I. Vanderfeesten, J. Cardoso, J. Mendling, H.A. Reijers, and W.M.P. van der Aalst, "Quality Metrics for Business Process Models," 2007 BPM and Workflow Handbook, Future Strategies, pp. 179-190, 2007.
[27] J. Wang, T. He, L. Wen, N. Wu, A.H.M. ter Hofstede, and J. Su, "A Behavioral Similarity Measure between Labeled Petri Nets Based on Principal Transition Sequences," Proc. On the Move to Meaningful Internet Systems Conf. (OTM '10), pp. 394-401, 2010.
[28] J. Wang, S. Tan, L. Wen, R.K. Wong, and Q. Guo, "An Empirical Evaluation of Process Mining Algorithms Based on Structural and Behavioral Similarities," Proc. 27th Ann. ACM Symp. Applied Computing (SAC '12), 2012.
[29] J. Wang, R.K. Wong, J. Ding, Q. Guo, and L. Wen, "On Recommendation of Process Mining Algorithms," Proc. IEEE 19th Int'l Conf. Web Services (ICWS '12), 2012.
[30] A.J.M.M. Weijters, W.M.P. van der Aalst, and A.K.A. de Medeiros, "Process Mining with Heuristicsminer Algorithm," BETA Working Paper Series, WP 166, Eindhoven Univ. of Tech nology, 2006.
[31] L. Wen, W.M.P. van der Aalst, J. Wang, and J. Sun, "Mining Process Models with Non-Free-Choice Constructs," Data Mining and Knowledge Discovery, vol. 15, no. 2, pp. 145-180, 2007.
[32] H. Zha, J. Wang, L. Wen, C. Wang, and J. Sun, "A Workflow Net Similarity Measure Based on Transition Adjacency Relations," Computers in Industry, vol. 61, no. 5, pp. 463-471, 2010.
183 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool