Reinforcement Learning and Artificial Intelligence (RLAI)
Paper Bank
Edited by Alborz Geramifard
The
ambition
of this page is to provide a paper bank for RL related papers together with people's comments and references..
Least-Square TD Methods:
M. G. Lagoudakis and R. Parr. "Model-Free Least-Squares policy iteration", Machine Learning 4(2003) 1107-1149, 2003
Boyan, J. A. "Technical Update: Least-Squares Temporal Difference Learning."
Machine Learning
49:233-246, 2002.
Xin X, He H. and Hu D. "Efficient Reinforcement Learning Using Recursive Least-Squares Methods",
Journal of Artificial Intelligence Research, Vol.16,2002, pp:259-292
R. Schoknecht and A. Merke. Convergent combinations of reinforcement learning with function approximation. In Advances in Neural Information Processing Systems, volume 15, 2003.
R. Schoknecht and A. Merke. TD(0) converges provably faster than the residual gradient algorithm. In ICML, pp 680–687, 2003.
Function Approximation:
J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: safely approximating the value function. In Advances in Neural Information Processing Systems 6, San Mateo, CA, 1995. Morgan Kaufmann
Pattern recognition:
Selfridge, O. G. "
Pandemonium: A paradigm for learning." National Physical Laboratory 1:513-529, 1958
Feature Discovery and Feature Manipulation:
M. Ahmadi, M. E. Taylor, and P. Stone. IFSA: Incremental Feature-Set Augmentation for Reinforcement Learning Tasks. In The Sixth International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), May 2007.
Behavior Imitation:
Price, B. and Boutilier, C. (2003) "Accelerating Reinforcement Learning through Implicit Imitation", Volume 19, pages 569-629
Policy Search:
A. Ng and M. Jordan. PEGASUS: A policy search method for large MDPs and POMDPs. In UAI, 2000
Modeling:
Sutton, R. S. (1995). TD models: Modeling the world at a mixture of time scales. In Prieditis, A. and Russell, S., editors, Machine Learning: Proceedings of the Twelfth International Conference, pages 531--539. Morgan Kaufmann Publishers, San Francisco, CA
World Knowledge:
How Bodies Matter: Five Themes for Interaction Design.
, by Scott R. Klemmer, Bjoern Hartmann, and Leila Takayama
Application :
Smith M., Lee-Urban S., and Munoz-Avila H., RETALIATE: Learning Winning Policies in First-Person Shooter Games,
Proceedings of the Seventeenth Innovative Applications of Artificial Intelligence Conference (IAAI-07). AAAI Press.