The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.05 - May (2013 vol.25)
pp: 1162-1174
Xuan Li , Institute of Software, Chinese Academy of Sciences, Beijing
Liang Du , Institute of Software, Chinese Academy of Sciences, Beijing
Yi-Dong Shen , Institute of Software, Chinese Academy of Sciences, Beijing
ABSTRACT
Due to the fast evolution of the information on the Internet, update summarization has received much attention in recent years. It is to summarize an evolutionary document collection at current time supposing the users have read some related previous documents. In this paper, we propose a graph-ranking-based method. It performs constrained reinforcements on a sentence graph, which unifies previous and current documents, to determine the salience of the sentences. The constraints ensure that the most salient sentences in current documents are updates to previous documents. Since this method is NP-hard, we then propose its approximate method, which is polynomial time solvable. Experiments on the TAC 2008 and 2009 benchmark data sets show the effectiveness and efficiency of our method.
INDEX TERMS
Manifolds, Cost function, Quadratic programming, Equations, Software, Computer science, quadratic programming, Summarization, update summarization, topic-focused summarization, multidocument summarization, extraction-based summarization, graph-based ranking, manifold ranking, large-margin constrained ranking, novelty, quadratically constrained quadratic programming
CITATION
Xuan Li, Liang Du, Yi-Dong Shen, "Update Summarization via Graph-Based Sentence Ranking", IEEE Transactions on Knowledge & Data Engineering, vol.25, no. 5, pp. 1162-1174, May 2013, doi:10.1109/TKDE.2012.42
REFERENCES
[1] S. Agarwal, "Ranking on Graph Data," Proc. Int'l Conf. Machine Learning (ICML '06), pp. 25-32, 2006.
[2] M.-R. Amini and P. Gallinari, "The Use of Unlabeled Data to Improve Supervised Learning for Text Summarization," Proc. 25th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval (SIGIR '02), pp. 105-112, 2002.
[3] F. Boudin, M. El-Bèze, and J.-M. Torres-Moreno, "A Scalable MMR Approach to Sentence Scoring for Multi-Document Update Summarization," Proc. Int'l Conf. Computational Linguistics (COLING '08), pp. 23-26, 2008.
[4] J. Carbonell and J. Goldstein, "The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries," Proc. 21st Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval (SIGIR '98), pp. 335-336, 1998.
[5] S. Chen, Y. Yu, C. Long, F. Jin, L. Qin, M. Huang, and Z. Xiaoyan, "Tsinghua University at the Summarization Track of TAC 2008," Proc. Text Analysis Conf. (TAC '08), 2008.
[6] J.M. Conroy and D.P. O'leary, "Text Summarization via Hidden Markov Models," Proc. 24th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval (SIGIR '01), pp. 406-407, 2001.
[7] J.M. Conroy and J.D. Schlesinger, "CLASSY 2007 at DUC 2007," Proc. Document Understanding Conf. (DUC '07), 2007.
[8] J.M. Conroy, J.D. Schlesinger, and D.P. O'leary, "CLASSY 2009: Summarization and Metrics," Proc. Text Analysis Conf. (TAC '09), 2009.
[9] P. Du, J. Guo, J. Zhang, and X. Cheng, "Manifold Ranking with Sink Points for Update Summarization," Proc. ACM Int'l Conf. Information and Knowledge Management (CIKM '10), pp. 1757-1760, 2010.
[10] G. Erkan and D.R. Radev, "Lexpagerank: Prestige in Multi-Document Text Summarization," Proc. Conf. Empirical Methods in Natural Language Processing (EMNLP '04), 2004.
[11] D. Gillick, B. Favre, and D. Hakkani-Tür, "The ICSI Summarization System at TAC 2008," Proc. Text Analysis Conf. (TAC '08), 2008.
[12] D. Gillick, B. Favre, D. Hakkani-Tür, B. Bohnet, Y. Liu, and S. Xie, "The ICSI/UTD Summarization System at TAC 2009," Proc. Text Analysis Conf. (TAC '09), 2009.
[13] K. Knight and D. Marcu, "Summarization Beyond Sentence Extraction: A Probabilistic Approach to Sentence Compression," Artificial Intelligence, vol. 139, no. 1, pp. 91-107, 2002.
[14] J. Kupiec, J. Pedersen, and F. Chen, "A Trainable Document Summarizer," Proc. 18th Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval (SIGIR '95), pp. 68-73, 1995.
[15] L.-D. Li, K. Zhou, G.-R. Xue, H.-Y. Zha, and Y. Yu, "Enhancing Diversity, Coverage and Balance for Summarization through Structure Learning," Proc. 18th Int'l Conf. World Wide Web (WWW '09), pp. 71-80, 2009.
[16] S.-J. Li, W. Wang, and Y.-W. Zhang, "TAC 2009 Update Summarization of ICL," Proc. Text Analysis Conf. (TAC '09), 2009.
[17] W. Li, F. Wei, Q. Lu, and Y. He, "${\rm PNR}^2$ : Ranking Sentences with Positive and Negative Reinforcement For Query-Oriented Update Summarization," Proc. 22nd Int'l Conf. Computational Linguistics (COLING '08), pp. 489-496, 2008.
[18] X. Li, Y.-D. Shen, L. Du, and C.-Y. Xiong, "Exploiting Novelty, Coverage and Balance for Topic-Focused Multi-Document Summarization," Proc. ACM Int'l Conf. Information and Knowledge Management (CIKM '10), pp. 1765-1768, 2010.
[19] X. Li, L. Du, and Y.-D. Shen, "Graph-Based Marginal Ranking for Update Summarization," Proc. SIAM Int'l Conf. Data Mining (SDM '11), pp. 486-497, 2011.
[20] C.-Y. Lin, "Rouge: A Package for Automatic Evaluation of Summaries." Proc. ACL Workshop Text Summarization Branches Out, p. 10, 2004.
[21] H.P. Luhn, "Automatic Creation of Literature Abstracts," IBM J. Research Development, vol. 2, pp. 159-165, 1958.
[22] D. Metzler and T. Kanungo, "Machine Learned Sentence Selection Strategies for Query-Biased Summarization." Proc. SIGIR Learning to Rank Workshop, 2008.
[23] R. Mihalcea and P. Tarau, "TextRank: Bringing Order Into Texts," Proc. Conf. Empirical Methods in Natural Language Processing (EMNLP '04), 2004.
[24] R. Mihalcea, "Language Independent Extractive Summarization," Proc. ACL Interactive Poster and Demonstration Sessions (ACL '05), pp. 49-52, 2005.
[25] V. Nastase, "Topic-Driven Multi-Document Summarization with Encyclopedic Knowledge and Spreading Activation." Proc. Conf. Empirical Methods in Natural Language Processing (EMNLP '08), pp. 763-772, 2008.
[26] J. Nocedal and S.J. Wright, Numerical Optimization. Springer-Verlag, 1999.
[27] T. Nomoto and Y. Matsumoto, "A New Approach to Unsupervised Text Summarization," Proc. Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval (SIGIR '01), pp. 26-34, 2001.
[28] D.R. Radev, H.-Y. Jing, M. Styś, and D. Tam, "Centroid-Based Summarization of Multiple Documents," Information Processing and Management, vol. 40, no. 6, pp. 919-938, 2004.
[29] D. Shen, J.-T. Sun, H. Li, Q. Yang, and Z. Chen, "Document Summarization Using Conditional Random Fields," Proc. Int'l Joint Conf. Artificial Intelligence (IJCAI '07), pp. 2862-2867, 2007.
[30] A. Tombros and M. Sanderson, "Advantages of Query Biased Summaries in Information Retrieval," Proc. 21st Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval (SIGIR '98), pp. 2-10, 1998.
[31] S. Tratz and E. Hovy, "Summarization Evaluation Using Transformed Basic Elements," Proc. Text Analytics Conf., 2008.
[32] X. Wan, J. Yang, and J. Xiao, "Manifold-Ranking Based Topic-Focused Multi-Document Summarization," Proc. Int'l Joint Conf. Artificial Intelligence (IJCAI '07), pp. 2903-2908, 2007.
[33] X. Wan, J. Yang, and J. Xiao, "Towards an Iterative Reinforcement Approach for Simultaneous Document Summarization and Keyword Extraction," Proc. Ann. Meeting Assoc. Computational Linguistics (ACL '07), pp. 552-559, 2007.
[34] X. Wan, "TimedTextRank: Adding the Temporal Dimension to Multi-Document Summarization," Proc. Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval (SIGIR '07), pp. 867-868, 2007.
[35] C. Wang, F. Jing, L. Zhang, and H.-J. Zhang, "Learning Query-Biased Web Page Summarization," Proc. Conf. Information and Knowledge Management (CIKM '07), pp. 555-562, 2007.
[36] D. Wang and T. Li, "Document Update Summarization Using Incremental Hierarchical Clustering," Proc. Conf. Information and Knowledge Management (CIKM '10), pp. 279-288, 2010.
[37] K.-F. Wong, M. Wu, and W. Li, "Extractive Summarization Using Supervised and Semi-Supervised Learning," Proc. 22nd Int'l Conf. Computational Linguistics (COLING '08), pp. 985-992, 2008.
[38] H.-Y. Zha, "Generic Summarization and Keyphrase Extraction Using Mutual Reinforcement Principle and Sentence Clustering," Proc. Ann. Int'l ACM SIGIR Conf. Research and Development in Information Retrieval (SIGIR '02), pp. 113-120, 2002.
[39] J. Zhang, P. Du, H.-B. Xu, and X.-Q. Cheng, "ICTGrasper at TAC2009: Temporal Preferred Update Summarization." Proc. Text Analysis Conf. (TAC '09), 2009.
[40] D. Zhou, J. Weston, A. Gretton, O. Bousquet, and B. Schölkopf, "Ranking on Data Manifolds," Proc. Advances in Neural Information Processing Systems (NIPS '03), pp. 169-176, 2003.
65 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool