The Community for Technology Leaders
Green Image
Issue No. 10 - Oct. (2013 vol. 25)
ISSN: 1041-4347
pp: 2231-2244
Jan Krogsgaard , TARGIT, US and Denmark
Torben Bach Pedersen , Aalborg University, Aalborg
Morten Middelfart , TARGIT, US and Denmark
ABSTRACT
This paper proposes a highly efficient bitmap-based approach for discovery of so-called sentinels. Sentinels represent schema level relationships between changes over time in certain measures in a multidimensional data cube. Sentinels are actionable and notify users based on previous observations, for example, that revenue might drop within two months if an increase in customer problems combined with a decrease in website traffic is observed. We significantly extend prior art by representing the sentinel mining problem by bitmap operations, using bitmapped encoding of so-called indication streams. We present a very efficient algorithm, SentBit, that is 2-3 orders of magnitude faster than the state of the art, and utilizes CPU specific instructions and the multicore architectures available on modern processors. The SentBit algorithm scales efficiently to very large data sets, which is verified by extensive experiments on both real and synthetic data.
INDEX TERMS
Data mining, Time measurement, Bidirectional control, Art, Encoding, Organizations, Databases, cube-based data mining, Data mining, Time measurement, Bidirectional control, Art, Encoding, Organizations, Databases, sentinels, Pattern mining, predictive data mining
CITATION
Jan Krogsgaard, Torben Bach Pedersen, Morten Middelfart, "Efficient Sentinel Mining Using Bitmaps on Modern Processors", IEEE Transactions on Knowledge & Data Engineering, vol. 25, no. , pp. 2231-2244, Oct. 2013, doi:10.1109/TKDE.2012.198
93 ms
(Ver )