
This Article  
 
Share  
Bibliographic References  
Add to:  
Digg Furl Spurl Blink Simpy Del.icio.us Y!MyWeb  
Search  
 
ASCII Text  x  
Jun Yan, Ning Liu, Shuicheng Yan, Qiang Yang, Weiguo (Patrick) Fan, Wei Wei, Zheng Chen, "TraceOriented Feature Analysis for LargeScale Text Data Dimension Reduction," IEEE Transactions on Knowledge and Data Engineering, vol. 23, no. 7, pp. 11031117, July, 2011.  
BibTex  x  
@article{ 10.1109/TKDE.2010.34, author = {Jun Yan and Ning Liu and Shuicheng Yan and Qiang Yang and Weiguo (Patrick) Fan and Wei Wei and Zheng Chen}, title = {TraceOriented Feature Analysis for LargeScale Text Data Dimension Reduction}, journal ={IEEE Transactions on Knowledge and Data Engineering}, volume = {23}, number = {7}, issn = {10414347}, year = {2011}, pages = {11031117}, doi = {http://doi.ieeecomputersociety.org/10.1109/TKDE.2010.34}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, }  
RefWorks Procite/RefMan/Endnote  x  
TY  JOUR JO  IEEE Transactions on Knowledge and Data Engineering TI  TraceOriented Feature Analysis for LargeScale Text Data Dimension Reduction IS  7 SN  10414347 SP1103 EP1117 EPD  11031117 A1  Jun Yan, A1  Ning Liu, A1  Shuicheng Yan, A1  Qiang Yang, A1  Weiguo (Patrick) Fan, A1  Wei Wei, A1  Zheng Chen, PY  2011 KW  Algebraic algorithms KW  computations on matrices KW  document analysis KW  global optimization. VL  23 JA  IEEE Transactions on Knowledge and Data Engineering ER   
[1] N.J. Belkin and W.B. Croft, "Retrieval Techniques," Ann. Rev. of Information Science and Technology, vol. 22, pp. 109145, 1987.
[2] S. Deerwester, S.T. Dumais, G.W. Furnas, T.K. Landauer, and R. Harshman, "Indexing by Latent Semantic Analysis," J. Am. Soc. for Information Science, vol. 41, pp. 391407, 1990.
[3] S.T. Dumais, "LSI Meets TREC: A Status Report," Proc. First Text Retrieval Conf. (TREC), pp. 137152, 1992.
[4] S.T. Dumais, "Using LSI for Information Filtering: TREC3 Experiments," Proc. Third Text Retrieval Conf. (TREC), 1995.
[5] S.T. Dumais, "Combining Evidence for Effective Information Filtering," Proc. AAAI Spring Symp. Machine Learning and Information Retrieval, pp. 2630, 1996.
[6] S.T. Dumais, "Latent Semantic Indexing (LSI) and TREC2," Proc. Second Text Retrieval Conf. (TREC), pp. 105116, 1993.
[7] S.T. Dumais, G.W. Furnas, T.K. Landauer, and S. Deerwester, "Using Latent Semantic Analysis to Improve Information Retrieval," Proc. ACM Conf. Human Factors in Computing Systems (CHI '88), pp. 281285, 1988.
[8] S.T. Dumais and J. Nielsen, "Automating the Assignment of Submitted Manuscripts to Reviewers," Proc. ACM SIGIR '92, pp. 233244, 1992.
[9] R.O. Duda, P.E. Hart, and D.G. Stork, Pattern Classification, second ed., Wiley, 2000.
[10] F. George, "An Extensive Empirical Study of Feature Selection Metrics for Text Classification," J. Machine Learning Research, vol. 3, pp. 12891305, 2003.
[11] E. Greengrass, "Information Retrieval: A Survey," Technical Report CSTR3514, Univ. of Maryland, 2000.
[12] X. He, "Locality Preserving Projections," PhD thesis, Computer Science Dept, the Univ. of Chicago, 2005.
[13] X. He, "Incremental SemiSupervised Subspace Learning for Image Retrieval," Proc. 12th Ann. ACM Int'l Conf. Multimedia, pp. 28, 2004.
[14] K. Hiraoka and M. Hamahira, "On Successive Learning Type Algorithm for Linear Discriminant Analysis," IEIC Technical Report, (in Japanese), vol. 99, pp. 8592, 1999.
[15] P. Howland and H. Park, "Generalizing Discriminant Analysis Using the Generalized Singular Value Decomposition," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 26, no. 8, pp. 9951006, Aug. 2004.
[16] M. Jeon, H. Park, and J.B. Rosen, "Dimension Reduction Based on Centroids and Least Squares for Efficient Processing of Text Data," Technical Report 01010, Univ. of Minnesota, 2001.
[17] I.T. Jolliffe, Principal Component Analysis. SprigerVerlag, 1986.
[18] D.D. Lewis, "Feature Selection and Feature Extraction for Text Categorization," Proc. Speech and Natural Language Workshop, pp. 212217, 1992.
[19] D. Lewis, Y. Yang, T. Rose, and F. Li, "RCV1: A New Benchmark Collection for Text Categorization Research," J. Machine Learning Research, vol. 5, pp. 361397, 2004.
[20] H. Li, T. Jiang, and K. Zhang, "Efficient and Robust Feature Extraction by Maximum Margin Criterion," Proc. Advances in Neural Information Processing Systems, pp. 97104, 2003.
[21] M.L. Littman, S.T. Dumais, and T.K. Landauer, "Automatic CrossLinguistic Information Retrieval Using Latent Semantic Indexing," Proc. Cross Language Information Retrieval, 1997.
[22] T. Liu, S. Liu, Z. Chen, and W.Y. Ma, "An Evaluation on Feature Selection for Text Clustering," Proc. 20th Int'l Conf. Machine Learning, pp. 484495, 2003.
[23] A.M. Martinez and A.C. Kak, "PCA versus LDA," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 2, pp. 228233, Feb. 2001.
[24] S.T. Roweis and L.K. Saul, "Nonlinear Dimensionality Reduction by Locally Linear Embedding," Science, vol. 290, pp. 23232326, 2000.
[25] B. Tang, X. Luo, M.I. Heywood, and M. Shepherd, "A Comparative Study of Dimension Reduction Techniques for Document Clustering," Technical Report CS200414, Faculty of Computer Science, Dalhousie Univ., 2004.
[26] C. Tang, Z. Xu, and S. Dwarkadas, "PeertoPeer Information Retrieval Using SelfOrganizing Semantic Overlay Networks," Proc. ACM SIGCOMM '03, pp. 175186, 2003.
[27] C. Tang, Z. Xu, and S. Dwarkadas, "On Scaling Latent Semantic Indexing for Large PeertoPeer Systems," Proc. ACM SIGIR '04, pp. 112 121, 2004.
[28] J.B. Tenenbaum, V. de Silva, and J.C. Langford, "A Global Geometric Framework for Nonlinear Dimensionality Reduction," Science, vol. 290, pp. 23192323, 2009.
[29] J. Thorsten, "Making LargeScale SVM Learning Practical," Advances in Kernel Methods: Support Vector Learning, B. Schölkopf, C. Burges, and A. Smola, eds., MIT Press, 1999.
[30] M.E. Wall, R. Andreas, and M.R. Luis, "Singular Value Decomposition and Principal Component Analysis," A Practical Approach to Microarray Data Analysis, pp. 91109, Kluwer Academic Publishers, 2003.
[31] J. Weng, Y. Zhang, and W.S. Hwang, "Candid CovarianceFree Incremental Principal Component Analysis," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 25, no. 8, pp. 10341040, Aug. 2003.
[32] J. Yan, Zhang, B.S. Yan, Z. Chen, W. Fan, Q. Yang, W.Y. Ma, and Q. Cheng, "IMMC: Incremental Maximum, Marginal Criterion," Proc. 10th ACM SIGKDD, pp. 725730, 2004.
[33] J. Yan, B. Zhang, N. Liu, S. Yan, Q. Cheng, W. Fan, Q. Yang, W. Xi, and Z. Chen, "Effective and Efficient Dimensionality Reduction for LargeScale and Streaming Data Preprocessing," IEEE Trans. Knowledge and Data Eng., vol. 18, no. 3, pp. 320333, Mar. 2006.
[34] J. Yan, N. Liu, B. Zhang, S. Yan, Z. Chen, Q. Cheng, W. Fan, and W.Y. Ma, "OCFS: Optimal Orthogonal Centroid Feature Selection for Text Categorization," Proc. ACM SIGIR, pp. 122129, 2005.
[35] J. Yan, N. Liu, B. Zhang, S. Yan, and Z. Chen, "A Novel Scalable Algorithm for Supervised Subspace Learning," Proc. Sixth IEEE Int'l Conf. Data Mining, pp. 721730, 2007.
[36] S. Yan, D. Xu, B. Zhang, H. Zhang, Q. Yang, and S. Lin, "Graph Embedding and Extension: A General Framework for Dimensionality Reduction," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 29, no. 1, pp. 4051, Jan. 2007.
[37] Y. Yang and X. Liu, "A ReExamination of Text Categorization Methods," Proc. ACM SIGIR, pp. 4249, 1999.
[38] Y. Yang, "Noise Reduction in a Statistical Approach to Text Categorization," Proc. ACM SIGIR '95, pp. 256263, 1995.
[39] Y. Yang and J.O. Pedersen, "A Comparative Study on Feature Selection in Text Categorization," Proc. 14th Int'l Conf. Machine Learning, pp. 412420, 1997.
[40] O. Zamir and O.G. Etzioni, "A Dynamic Clustering Interface to Web Search Results," Proc. Eighth Int'l World Wide Web Conf. (WWW8), May 1999.
[41] LSA @ CU Boulder, http:/lsa.colorado.edu/, 2010.
[42] F. Cozman, I. Cohen, and M. Cirelo, "SemiSupervised Learning of Mixture Models," Proc. 20th Int'l Conf. Machine Learning, pp. 99106, 2003.
[43] D. Crabtree, G. Xiaoying, and P. Andreae, "Standardized Evaluation Method for Web Clustering Results 2005," Proc. 2005 IEEE/WIC/ACM Int'l Conf. Web Intelligence, pp. 280283, 2005.