This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
A Statistical Model for User Preference
June 2005 (vol. 17 no. 6)
pp. 834-843
Modeling user preference is one of the challenging issues in intelligent information systems. Extensive research has been performed to automatically analyze user preference and to utilize it. One problem still remains: The representation of preference, usually given by measure of vector similarity or probability, does not always correspond to common sense of preference. This problem gets worse in the case of negative preference. To overcome this problem, this paper presents a preference model using mutual information in a statistical framework. This paper also presents a method that combines information of joint features and alleviates problems arising from sparse data. Experimental results, compared with the previous recommendation models, show that the proposed model has the highest accuracy in recommendation tests.

[1] L.A. Adamic, “Zipf, Power-Laws, and Pareto— A Ranking Tutorial, Internet Ecologies Area,” http://www.hpl.hp.com/shl/papers/ranking ranking.html, 2000.
[2] G. Adomavicius and A. Tuzhilin, “Using Data Mining Methods to Build Customer Profiles,” Computer, pp. 74-82, 2001.
[3] R. Agrawal et al., “Mining Association Rules between Sets of Items in Large Databases,” Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 207-216, 1993.
[4] D. Billsus and M.J. Pazzani, “Learning Collaborative Information Filters,” Proc. Machine Learning Conf., 1998.
[5] D. Billsus and M. Pazzani, “A Hybrid User Model for News Story Classification,” Proc. Seventh Int'l Conf. User Modeling, June 1999.
[6] J.S. Breese et al., “Empirical Analysis of Predictive Algorithms for Collaborative Filtering,” Proc. Fourth Conf. Uncertainty in Artificial Intelligence, 1998.
[7] W.B. Frakes and R. Baeza-Yates, Information Retrieval: Data Structures & Algorithms. Prentice-Hall, 1992.
[8] S.A. Goldman and M.K. Warmuth, “Learning Binary Relations Using Weighted Majority Voting,” Proc. ACM Conf. Computational Learning Theory, pp. 453-462, 1993.
[9] T. Joachims, “A Probabilistic Analysis of the Rocchio Algorithm with TF-IDF for Text Categorization,” Proc. Int'l Conf. Machine Learning, pp. 143-151, 1997.
[10] T. Joachims, “Text Categorization with Support Vector Machines: Learning with Many Relevant Features,” Proc. European Conf. Machine Learning, pp. 137-142, 1998.
[11] S.Y. Jung et al., “MRF-Based English Part-of-Speech Tagging System,” Proc. 16th Int'l Conf. Computational Linguistics, 1996.
[12] W. Li, “Random Texts Exhibit Zipf's-law-like Word Frequency Distribution,” IEEE Trans. Information Theory, vol. 38, no. 6, pp. 1842-1845, 1992.
[13] A. Nakamura and N. Abe, “Collaborative Filtering Using Weighted Majority Prediction Algorithms,” Proc. Int'l Conf. Machine Learning, pp. 395-403, 1998.
[14] P. Resnick et al., “GroupLens: An Open Architecture for Collaborative Filtering of Netnews,” Internal Research Report, MIT Center for Coordination Science, 1994.
[15] W.J. Reed, “The Pareto, Zipf and Other Power Laws,” Economics Letters, 2000.
[16] S. Russel and P. Norvig, “Making Simple Decisions,” Artificial Intelligence— A Modern Approach, pp. 471-497, Prentice Hall, 1995.
[17] I. Schwab and W. Pohl, “Learning User Profiles from Positive Examples,” Proc. Int'l Conf. Machine Learning & Applications, pp. 15-20, 1999.
[18] C. Silverstein et al., “Beyond Market Baskets: Generalizing Association Rules to Dependence Rules,” Data Mining and Knowledge Discovery, vol. 2, no. 1, pp. 39-68, 1998.
[19] R. Srikant et al., “Mining Generalized Association Rules,” VLDB J., 1995.
[20] T. Zhang, “Text Categorization Based on Regularized Linear Classification Methods,” Information Retrieval, vol. 4, no. 1, pp. 5-31, 2001.

Index Terms:
Personalization, user preference, recommendation, user modeling, text mining, mutual information, data sparseness, feature-combining weight, Pareto distribution.
Citation:
Sung Young Jung, Jeong-Hee Hong, Taek-Soo Kim, "A Statistical Model for User Preference," IEEE Transactions on Knowledge and Data Engineering, vol. 17, no. 6, pp. 834-843, June 2005, doi:10.1109/TKDE.2005.86
Usage of this product signifies your acceptance of the Terms of Use.