This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Second IEEE International Conference on Data Mining (ICDM'02)
Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing
Maebashi City, Japan
December 09-December 12
ISBN: 0-7695-1754-4
Naoki Abe, I.B.M. T. J. Watson Research Center
Edwin Pednault, I.B.M. T. J. Watson Research Center
Haixun Wang, I.B.M. T. J. Watson Research Center
Bianca Zadrozny, I.B.M. T. J. Watson Research Center
Wei Fan, I.B.M. T. J. Watson Research Center
Chid Apte, I.B.M. T. J. Watson Research Center
We empirically evaluate the performance of various re-inforcement learning methods in applications to sequential targeted marketing. In particular, we propose and evaluate a progression of reinforcement learning methods, ranging from the "direct" or "batch" methods to "indirect" or "simulation based" methods, and those that we call "semi-direct" methods that fall between them. We conduct a num-ber of controlled experiments to evaluate the performance of these competing methods. Our results indicate that while the indirect methods can perform better in a situation in which nearly perfect modeling is possible, under the more realistic situations in which the system's modeling parameters have restricted attention, the indirect methods' performance tend to degrade. We also show that semi-direct methods are effective in reducing the amount of computation necessary to attain a given level of performance, and often result in more profitable policies.
Citation:
Naoki Abe, Edwin Pednault, Haixun Wang, Bianca Zadrozny, Wei Fan, Chid Apte, "Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing," icdm, pp.3, Second IEEE International Conference on Data Mining (ICDM'02), 2002
Usage of this product signifies your acceptance of the Terms of Use.