loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fifth Mexican International Conference in Computer Science (ENC'04)
Efficient Data Structures and Parallel Algorithms for Association Rules Discovery
Colima, M?xico
September 20-September 24
ISBN: 0-7695-2160-6
Christophe C?rin, Universit? de Picardie Jules Verne
Jean-S?batien Gay, Universit? de Picardie Jules Verne
Ga? Le Mahec, Universit? de Picardie Jules Verne
Michel Koskas, Universit? de Picardie Jules Verne
Discovering patterns or frequent episodes in transactions is an important problem in data-mining for the purpose of infering deductive rules from them. Because of the huge size of the data to deal with, parallel algorithms have been designed for reducing both the execution time and the number of repeated passes over the database in order to reduce, as much as possible, I/O overheads. In this paper, we introduce new approaches for the implementation of two basic algorithms for association rules discovery (namely Apriori and Eclat). Our approaches combine efficient data structures to code different key information (line indexes, candidates) and we exhibit how to introduce parallelism for processing such data-structures.
Index Terms:
Datamining, Association rules discovery, Radix Trees and bit vectors, Apriori, Eclat and Count Distribution algorithms
Citation:
Christophe C?rin, Jean-S?batien Gay, Ga? Le Mahec, Michel Koskas, "Efficient Data Structures and Parallel Algorithms for Association Rules Discovery," enc, pp.399-406, Fifth Mexican International Conference in Computer Science (ENC'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.