The Community for Technology Leaders
2013 IEEE 29th International Conference on Data Engineering (ICDE) (2002)
San Jose, California
Feb. 26, 2002 to Mar. 1, 2002
ISBN: 0-7695-1531-2
pp: 0685
Sudipto Guha , University of Pennsylvania
Liadan O'Callaghan , Stanford University
Rajeev Motwani , Stanford University
Nina Mishra , Hewlett Packard Laboratories
Adam Meyerson , Stanford University
ABSTRACT
Streaming data analysis has recently attracted attention in numerous applications including telephone records, web documents and clickstreams. For such analysis, single-pass algorithms that consume a small amount of memory are critical. We describe such a streaming algorithm that effectively clusters large data streams. We also provide empirical evidence of the algorithm's performance on synthetic and real data streams.
INDEX TERMS
CITATION
Sudipto Guha, Liadan O'Callaghan, Rajeev Motwani, Nina Mishra, Adam Meyerson, "Streaming-Data Algorithms for High-Quality Clustering", 2013 IEEE 29th International Conference on Data Engineering (ICDE), vol. 00, no. , pp. 0685, 2002, doi:10.1109/ICDE.2002.994785
104 ms
(Ver 3.3 (11022016))