|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology
Informative Polythetic Hierarchical Ephemeral Clustering
Lyon, France
August 22-August 27
ISBN: 978-0-7695-4513-4
| ASCII Text | x | ||
| Gaël Dias, Guillaume Cleuziou, David Machado, "Informative Polythetic Hierarchical Ephemeral Clustering," Web Intelligence and Intelligent Agent Technology, IEEE/WIC/ACM International Conference on, vol. 1, pp. 104-111, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, 2011. | |||
| BibTex | x | ||
| @article{ 10.1109/WI-IAT.2011.123, author = {Gaël Dias and Guillaume Cleuziou and David Machado}, title = {Informative Polythetic Hierarchical Ephemeral Clustering}, journal ={Web Intelligence and Intelligent Agent Technology, IEEE/WIC/ACM International Conference on}, volume = {1}, year = {2011}, isbn = {978-0-7695-4513-4}, pages = {104-111}, doi = {http://doi.ieeecomputersociety.org/10.1109/WI-IAT.2011.123}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Web Intelligence and Intelligent Agent Technology, IEEE/WIC/ACM International Conference on TI - Informative Polythetic Hierarchical Ephemeral Clustering SN - 978-0-7695-4513-4 SP104 EP111 A1 - Gaël Dias, A1 - Guillaume Cleuziou, A1 - David Machado, PY - 2011 KW - Hierarchical Ephemeral Clustering KW - Polythetic Web Snippet Representation KW - Informative Similarity Measure KW - Automatic Cluster and Label Evaluation VL - 1 JA - Web Intelligence and Intelligent Agent Technology, IEEE/WIC/ACM International Conference on ER - | |||
Ephemeral clustering has been studied for more than a decade, although with low user acceptance. According to us, this situation is mainly due to (1) an excessive number of generated clusters, which makes browsing difficult and (2) low quality labeling, which introduces imprecision within the search process. In this paper, our motivation is twofold. First, we propose to reduce the number of clusters of Web page results, but keeping all different query meanings. For that purpose, we propose a new polythetic methodology based on an informative similarity measure, the InfoSimba, and a new hierarchical clustering algorithm, the HISGK-means. Second, a theoretical background is proposed to define meaningful cluster labels embedded in the definition of the HISGK-means algorithm, which may elect as best label, words outside the given cluster. To confirm our intuitions, we propose a new evaluation framework, which shows that we are able to extract most of the important query meanings but generating much less clusters than state-of-the-art systems.
Index Terms:
Hierarchical Ephemeral Clustering, Polythetic Web Snippet Representation, Informative Similarity Measure, Automatic Cluster and Label Evaluation
Citation:
Gaël Dias, Guillaume Cleuziou, David Machado, "Informative Polythetic Hierarchical Ephemeral Clustering," wi-iat, vol. 1, pp.104-111, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, 2011
Usage of this product signifies your acceptance of the Terms of Use.
