This Article 
 Bibliographic References 
 Add to: 
A Blog Mining Framework
January/February 2009 (vol. 11 no. 1)
pp. 36-41
Michael Chau, University of Hong Kong
Jennifer Xu, Bentley College
Jinwei Cao, University of Delaware
Porsche Lam, Royal Bank of Scotland
Boby Shiu, University of Hong Kong
Blogs have become increasingly popular, and new blogs are generated every day. Many of the contents are useful for applications in various domains, such as business, politics, research, social work, and linguistics. However, automatically collecting and analyzing blogs isn't straightforward due to the large size and dynamic nature of the blogosphere. In this article, the authors propose a framework for blog mining that includes spiders, parsers, analyzers, and visualizers. They present several examples of blog mining applications based on their framework.

1. H. Qian and C.R. Scott, "Anonymity and Self-Disclosure on Weblogs," J. Computer-Mediated Comm., vol. 12, no. 4, p. 1.
2. N. Glance et al., "Analyzing Online Discussion for Marketing Intelligence," Proc. 14th Int'l Conf. WWW (WWW 2005), ACM Press, 2005, pp. 1172–1173.
3. A. Qamra, B. Tseng, and E.Y. Chang, "Mining Blog Stories Using Community-Based and Temporal Clustering," Proc. 15th ACM Int'l Conf. Information and Knowledge Management (CIKM 2006), ACM Press, 2006, pp. 58–67.
4. B. Chen et al., "Predicting Blogging Behavior Using Temporal and Social Networks," Proc. 7th IEEE Int'l Conf. Data Mining (ICDM 2007), IEEE CS Press, 2007, pp. 439–444.
5. T. Nanno et al., "Automatically Collecting, Monitoring, and Mining Japanese Weblogs," Proc. 13th Int'l Conf. WWW, (WWW 2004), ACM Press, 2004, 320–321.
6. B. Nardi et al., "Why We Blog," Comm. ACM, vol. 47, no. 12, 2004, pp. 41–46.
7. R. Blood, R., "How Blogging Software Reshapes the Online Community," Comm. ACM, vol. 47, no. 12, 2004, pp. 53–55.
8. R. Kumar et al., "Trawling the Web for Emerging Cybercommunities," Computer Networks, vol. 31, nos. 11–16, 1999, pp. 1481–1493.
9. S. Baker and H. Green . "Blogs Will Change Your Business," Business Week,2 May 2005, pp. 44–53.
10. M. Chau, and H. Chen, "Personalized and Focused Web Spiders," Web Intelligence, eds., N. Zhong, J. Liu, and Y. Yao eds., Springer-Verlag, 2003.
11. D. Shen et al., "Latent Friend Mining from Blog Data," Proc. 6th IEEE Int'l Conf. on Data Mining (ICDM 2006), IEEE CS Press, pp. 552–561.
12. K. Ishida, "Extracting Latent Weblog Communities—A Partitioning Algorithm for Bipartite Graph," , Proc. Ann. Workshop Weblogging Ecosystem: Aggregation, Analysis, and Dynamics, ACM Press, 2005, pp. 1–11.
13. M. Chau and J. Xu, "Mining Communities and Their Relationships in Blogs: A Study of Online Hate Groups," Int'l J. Human-Computer Studies, vol. 65, no. 1, 2007, pp. 57–70.
14. L.C. Freeman, "Centrality in Social Networks: Conceptual Clarification," Social Networks, vol. 1, no. 3, 1979, pp. 215–240.
15. M. Chau and J. Xu, "Studying Customer Groups from Blogs," Proc. 6th WeB 2007, (WEB2007), 2007.

Index Terms:
blogs, blog mining, Web mining, social networks, applications, IT professionals
Michael Chau, Jennifer Xu, Jinwei Cao, Porsche Lam, Boby Shiu, "A Blog Mining Framework," IT Professional, vol. 11, no. 1, pp. 36-41, Jan.-Feb. 2009, doi:10.1109/MITP.2009.1
Usage of this product signifies your acceptance of the Terms of Use.