The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.02 - March/April (2010 vol.14)
pp: 24-32
Geetika T. Lakshmanan , IBM T. J. Watson Research Center
Martin A. Oberhofer , IBM Software Group, Germany
ABSTRACT
Knowledge discovery in blogs is different from knowledge discovery in areas such as databases or Web documents due to blogs' unique characteristics, which introduce additional mining challenges. Although researchers have investigated several techniques to address different aspects of blog discovery, no comparisons among key knowledge discovery techniques for blogs exist. This article examines three prominent techniques that are frequently applied to discovery in blogs — clustering, matrix decomposition, and ranking. The authors compare them in terms of effectiveness in combating present challenges and their ability to accomplish challenging tasks required for effective blog mining.
INDEX TERMS
knowledge retrieval, knowledge management, artificial intelligence, computing methodologies, clustering, information search and retrieval, information storage and retrieval, information technology and systems, interactive data exploration and discovery, database applications, database management, mining methods and algorithms, blogs, trend analysis, topic detection, collective wisdom
CITATION
Geetika T. Lakshmanan, Martin A. Oberhofer, "Knowledge Discovery in the Blogosphere: Approaches and Challenges", IEEE Internet Computing, vol.14, no. 2, pp. 24-32, March/April 2010, doi:10.1109/MIC.2010.26
REFERENCES
1. N. Agarwal and H. Liu, "Blogosphere: Research Issues, Tools, and Applications," ACM Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD) Explorations, vol. 10, no. 1, 2008, pp. 18–31.
2. Y-R. Lin et al., "Blog Community Discovery and Evolution Based on Mutual Awareness Expansion," Proc. Conf. Web Intelligence, ACM Press, 2007, pp. 48–56.
3. K.C. Sia et al., "Efficient Computation of Personal Aggregate Queries on Blogs," Proc. Knowledge Discovery and Data Mining Conf., ACM Press, 2008, pp. 632–640.
4. D. Gruhl et al., "The Predictive Power of Online Chatter," Proc. Int'l Conf. Knowledge Discovery and Data Mining, ACM Press, 2005, pp. 78–87.
5. X. Song et al., "Identifying Opinion Leaders in the Blogosphere," Proc. Conf. Information and Knowledge Management, ACM Press, 2007, pp. 971–974.
6. Y. Chi et al., "Structural and Temporal Analysis of the Blogosphere through Community Factorization," Proc. Knowledge Discovery and Data Mining Conf., ACM Press, 2007, pp. 163–172.
7. N. Bansal et al., "Seeking Stable Clusters in the Blogosphere," Proc. Very Large Databases Conf., ACM Press, 2007, pp. 806–817.
8. A. Qamra, B. Tseng, and E.Y. Chang, "Mining Blog Stories Using Community-Based and Temporal Clustering," Proc. Conf. Information and Knowledge Management, ACM Press, 2006, pp. 58–67.
9. N. Agarwal et al., "Identifying the Influential Bloggers in a Community," Proc. Web Search and Data Mining Conf., ACM Press, 2008, pp. 207–217.
10. A. Stewart et al., "Discovering Information Diffusion Paths from Blogosphere for Online Advertising," Proc. Workshop Data Mining and Audience Intelligence for Advertising in Conjunction with Knowledge Discovery and Data Mining, ACM Press, 2007, pp. 46–54.
11. N. Agarwal et al., "Clustering Blogs with Collective Wisdom," Proc. Int'l Conf. Web Engineering, IEEE CS Press, 2008, pp. 14–18.
12. X. Li, L. Guo, and Y. Zhao, "Tag-Based Social Interest Discovery," Proc. World Wide Web Conf., ACM Press, 2008, pp. 675–684.
13. I.S. Dhillon, S. Mallela, and D.S. Modha, "Information Theoretic Coclustering," Proc. Int'l Conf. Knowledge Discovery and Data Mining, ACM Press, 2003, pp. 89–98.
14. Y. Chi, B.L. Tseng, and J. Tatemura, "Eigen-Trend: Trend Analysis in the Blogosphere Based on Singular Value Decompositions," Proc. Conf. Information and Knowledge Management, ACM Press, 2006, pp. 68–77.
15. Y. Chen, F.S. Tsai, and K.L. Chan, "Blog Search and Mining in the Business Domain," Proc. Workshop on Domain Driven Data Mining in Conjunction with Knowledge Discovery and Data Mining, ACM Press, 2007, pp. 55–60.
16. K.C. Sia et al., "Efficient Computation of Personal Aggregate Queries on Blogs," Proc. Knowledge Discovery and Data Mining, ACM Press, 2008, pp. 632–640.
17. H. Tong et al., "Colibri: Fast Mining of Large Static and Dynamic Graphs," Proc. Conf. Knowledge Discovery and Data Mining, ACM Press, 2008, pp. 686–694.
18. A. Hassan et al., "Content-Based Recommendation and Summarization in the Blogosphere," Proc. Conf. Weblogs and Social Media, ACM Press, 2009; www.aaai.org/ocs/index.php/ICWSM/09/paper/ view203.
19. W. Zhang, C. Yu, and W. Meng, "Opinion Retrieval from Blogs," Proc. Conf. Information and Knowledge Management, ACM Press, 2007, pp. 831–840.
20. R. Kumar et al., "Efficient Discovery of Authoritative Resources," Proc. Int'l Conf. Data Engineering, ACM Press, 2008, pp. 1495–1497.
18 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool