The Community for Technology Leaders
2015 IEEE 31st International Conference on Data Engineering (ICDE) (2015)
Seoul, South Korea
April 13, 2015 to April 17, 2015
ISBN: 978-1-4799-7964-6
pp: 879-890
Lukasz Golab , University of Waterloo, Canada
Flip Korn , Google Research, USA
Feng Li , National University of Singapore, Singapore
Barna Saha , University of Massachusetts, Amherst, USA
Divesh Srivastava , AT&T Labs - Research, USA
In this paper, we introduce a natural generalization of Weighted Set Cover and Maximum Coverage, called Size-Constrained Weighted Set Cover. The input is a collection of n elements, a collection of weighted sets over the elements, a size constraint k, and a minimum coverage fraction ŝ; the output is a sub-collection of up to k sets whose union contains at least ŝn elements and whose sum of weights is minimal. We prove the hardness of approximation of this problem, and we present efficient approximation algorithms with provable quality guarantees that are the best possible. In many applications, the elements are data records, and the set collection to choose from is derived from combinations (patterns) of attribute values. We provide optimization techniques for this special case. Finally, we experimentally demonstrate the effectiveness and efficiency of our solutions.
Approximation methods, Approximation algorithms, Optimization, Polynomials, Heuristic algorithms, Pattern matching, Business

L. Golab, F. Korn, F. Li, B. Saha and D. Srivastava, "Size-Constrained Weighted Set Cover," 2015 IEEE 31st International Conference on Data Engineering (ICDE), Seoul, South Korea, 2015, pp. 879-890.
83 ms
(Ver 3.3 (11022016))