9th International Database Engineering & Application Symposium (IDEAS'05) Distribution-Based Synthetic Database Generation Techniques for Itemset Mining Montreal, Canada July 25-July 27 ISBN: 0-7695-2404-4
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/IDEAS.2005.22
The resource requirements of frequent pattern mining algorithms depend mainly on the length distribution of the mined patterns in the database. Synthetic databases, which are used to benchmark performance of algorithms, tend to have distributions far different from those observed in real datasets. In this paper we focus on the problem of synthetic database generation and propose algorithms to effectively embed within the database, any given set of maximal pattern collections, and make the following contributions:
Citation:
Ganesh Ramesh, Mohammed J. Zaki, William A. Maniatty, "Distribution-Based Synthetic Database Generation Techniques for Itemset Mining," ideas, pp.307-316, 9th International Database Engineering & Application Symposium (IDEAS'05), 2005 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||