|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Second IEEE International Conference on Data Mining (ICDM'02)
Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing
Maebashi City, Japan
December 09-December 12
ISBN: 0-7695-1754-4
| ASCII Text | x | ||
| Naoki Abe, Edwin Pednault, Haixun Wang, Bianca Zadrozny, Wei Fan, Chid Apte, "Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing," Data Mining, IEEE International Conference on, pp. 3, Second IEEE International Conference on Data Mining (ICDM'02), 2002. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDM.2002.1183879, author = {Naoki Abe and Edwin Pednault and Haixun Wang and Bianca Zadrozny and Wei Fan and Chid Apte}, title = {Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing}, journal ={Data Mining, IEEE International Conference on}, volume = {0}, year = {2002}, isbn = {0-7695-1754-4}, pages = {3}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDM.2002.1183879}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Data Mining, IEEE International Conference on TI - Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing SN - 0-7695-1754-4 SP EP A1 - Naoki Abe, A1 - Edwin Pednault, A1 - Haixun Wang, A1 - Bianca Zadrozny, A1 - Wei Fan, A1 - Chid Apte, PY - 2002 KW - null VL - 0 JA - Data Mining, IEEE International Conference on ER - | |||
We empirically evaluate the performance of various re-inforcement learning methods in applications to sequential targeted marketing. In particular, we propose and evaluate a progression of reinforcement learning methods, ranging from the "direct" or "batch" methods to "indirect" or "simulation based" methods, and those that we call "semi-direct" methods that fall between them. We conduct a num-ber of controlled experiments to evaluate the performance of these competing methods. Our results indicate that while the indirect methods can perform better in a situation in which nearly perfect modeling is possible, under the more realistic situations in which the system's modeling parameters have restricted attention, the indirect methods' performance tend to degrade. We also show that semi-direct methods are effective in reducing the amount of computation necessary to attain a given level of performance, and often result in more profitable policies.
Citation:
Naoki Abe, Edwin Pednault, Haixun Wang, Bianca Zadrozny, Wei Fan, Chid Apte, "Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing," icdm, pp.3, Second IEEE International Conference on Data Mining (ICDM'02), 2002
Usage of this product signifies your acceptance of the Terms of Use.
