Issue No. 10 - Oct. (2013 vol. 25)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2012.198
Morten Middelfart , TARGIT, US and Denmark
Torben Bach Pedersen , Aalborg University, Aalborg
Jan Krogsgaard , TARGIT, US and Denmark
This paper proposes a highly efficient bitmap-based approach for discovery of so-called sentinels. Sentinels represent schema level relationships between changes over time in certain measures in a multidimensional data cube. Sentinels are actionable and notify users based on previous observations, for example, that revenue might drop within two months if an increase in customer problems combined with a decrease in website traffic is observed. We significantly extend prior art by representing the sentinel mining problem by bitmap operations, using bitmapped encoding of so-called indication streams. We present a very efficient algorithm, SentBit, that is 2-3 orders of magnitude faster than the state of the art, and utilizes CPU specific instructions and the multicore architectures available on modern processors. The SentBit algorithm scales efficiently to very large data sets, which is verified by extensive experiments on both real and synthetic data.
Data mining, Time measurement, Bidirectional control, Art, Encoding, Organizations, Databases, cube-based data mining, Data mining, Time measurement, Bidirectional control, Art, Encoding, Organizations, Databases, sentinels, Pattern mining, predictive data mining
J. Krogsgaard, T. B. Pedersen and M. Middelfart, "Efficient Sentinel Mining Using Bitmaps on Modern Processors," in IEEE Transactions on Knowledge & Data Engineering, vol. 25, no. , pp. 2231-2244, 2013.