2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI'06) Finding Conceptual Document Clusters with Improved Top-N Formal Concept Search Hong Kong, China December 18-December 22 ISBN: 0-7695-2747-7
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/WI.2006.81
In this paper, we discuss a method for conceptual clustering of documents. Our cluster is defined with the notion of Formal Concept Analysis which can provide a conceptual meaning for each document cluster. Our clustering is formalized as a Top-N \delta-valid formal concept problem. We improve our previous clique search-based algorithm for the problem so that it can be applied to larger scale datasets. For more efficient computation, we present some pruning rules based on theoretical properties of formal concepts. A depth-first branch-and-bound algorithm with the prunings is designed. Our experimental results show valuable clusters can be extracted from a collection of web documents. Moreover, the algorithm outperforms some fast algorithms for mining closed itemsets equivalent to formal concepts.
Citation:
Yoshiaki Okubo, Makoto Haraguchi, "Finding Conceptual Document Clusters with Improved Top-N Formal Concept Search," wi, pp.347-351, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI'06), 2006 Usage of this product signifies your acceptance of the Terms of Use. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||