The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.01 - January/February (2009 vol.11)
pp: 36-41
Jennifer Xu , Bentley College
Jinwei Cao , University of Delaware
Porsche Lam , Royal Bank of Scotland
Michael Chau , University of Hong Kong
ABSTRACT
Blogs have become increasingly popular, and new blogs are generated every day. Many of the contents are useful for applications in various domains, such as business, politics, research, social work, and linguistics. However, automatically collecting and analyzing blogs isn't straightforward due to the large size and dynamic nature of the blogosphere. In this article, the authors propose a framework for blog mining that includes spiders, parsers, analyzers, and visualizers. They present several examples of blog mining applications based on their framework.
INDEX TERMS
blogs, blog mining, Web mining, social networks, applications, IT professionals
CITATION
Jennifer Xu, Jinwei Cao, Porsche Lam, Michael Chau, "A Blog Mining Framework", IT Professional, vol.11, no. 1, pp. 36-41, January/February 2009, doi:10.1109/MITP.2009.1
REFERENCES
1. H. Qian and C.R. Scott, "Anonymity and Self-Disclosure on Weblogs," J. Computer-Mediated Comm., vol. 12, no. 4, p. 1.
2. N. Glance et al., "Analyzing Online Discussion for Marketing Intelligence," Proc. 14th Int'l Conf. WWW (WWW 2005), ACM Press, 2005, pp. 1172–1173.
3. A. Qamra, B. Tseng, and E.Y. Chang, "Mining Blog Stories Using Community-Based and Temporal Clustering," Proc. 15th ACM Int'l Conf. Information and Knowledge Management (CIKM 2006), ACM Press, 2006, pp. 58–67.
4. B. Chen et al., "Predicting Blogging Behavior Using Temporal and Social Networks," Proc. 7th IEEE Int'l Conf. Data Mining (ICDM 2007), IEEE CS Press, 2007, pp. 439–444.
5. T. Nanno et al., "Automatically Collecting, Monitoring, and Mining Japanese Weblogs," Proc. 13th Int'l Conf. WWW, (WWW 2004), ACM Press, 2004, 320–321.
6. B. Nardi et al., "Why We Blog," Comm. ACM, vol. 47, no. 12, 2004, pp. 41–46.
7. R. Blood, R., "How Blogging Software Reshapes the Online Community," Comm. ACM, vol. 47, no. 12, 2004, pp. 53–55.
8. R. Kumar et al., "Trawling the Web for Emerging Cybercommunities," Computer Networks, vol. 31, nos. 11–16, 1999, pp. 1481–1493.
9. S. Baker and H. Green . "Blogs Will Change Your Business," Business Week,2 May 2005, pp. 44–53.
10. M. Chau, and H. Chen, "Personalized and Focused Web Spiders," Web Intelligence, eds., N. Zhong, J. Liu, and Y. Yao eds., Springer-Verlag, 2003.
11. D. Shen et al., "Latent Friend Mining from Blog Data," Proc. 6th IEEE Int'l Conf. on Data Mining (ICDM 2006), IEEE CS Press, pp. 552–561.
12. K. Ishida, "Extracting Latent Weblog Communities—A Partitioning Algorithm for Bipartite Graph," , Proc. Ann. Workshop Weblogging Ecosystem: Aggregation, Analysis, and Dynamics, ACM Press, 2005, pp. 1–11.
13. M. Chau and J. Xu, "Mining Communities and Their Relationships in Blogs: A Study of Online Hate Groups," Int'l J. Human-Computer Studies, vol. 65, no. 1, 2007, pp. 57–70.
14. L.C. Freeman, "Centrality in Social Networks: Conceptual Clarification," Social Networks, vol. 1, no. 3, 1979, pp. 215–240.
15. M. Chau and J. Xu, "Studying Customer Groups from Blogs," Proc. 6th WeB 2007, (WEB2007), 2007.
29 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool