Paper Bank

	Reinforcement Learning and Artificial Intelligence (RLAI)
	Paper Bank

Edited by Alborz Geramifard

The ambition of this page is to provide a paper bank for RL related papers together with people's comments and references..

Least-Square TD Methods:

M. G. Lagoudakis and R. Parr. "Model-Free Least-Squares policy iteration", Machine Learning 4(2003) 1107-1149, 2003
Boyan, J. A. "Technical Update: Least-Squares Temporal Difference Learning." Machine Learning 49:233-246, 2002.
Xin X, He H. and Hu D. "Efficient Reinforcement Learning Using Recursive Least-Squares Methods", Journal of Artificial Intelligence Research, Vol.16,2002, pp:259-292
R. Schoknecht and A. Merke. Convergent combinations of reinforcement learning with function approximation. In Advances in Neural Information Processing Systems, volume 15, 2003.
R. Schoknecht and A. Merke. TD(0) converges provably faster than the residual gradient algorithm. In ICML, pp 680–687, 2003.

Function Approximation:

J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: safely approximating the value function. In Advances in Neural Information Processing Systems 6, San Mateo, CA, 1995. Morgan Kaufmann

Pattern recognition:

Selfridge, O. G. "Pandemonium: A paradigm for learning." National Physical Laboratory 1:513-529, 1958

Feature Discovery and Feature Manipulation:

M. Ahmadi, M. E. Taylor, and P. Stone. IFSA: Incremental Feature-Set Augmentation for Reinforcement Learning Tasks. In The Sixth International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), May 2007.

Behavior Imitation:

Price, B. and Boutilier, C. (2003) "Accelerating Reinforcement Learning through Implicit Imitation", Volume 19, pages 569-629

Policy Search:

A. Ng and M. Jordan. PEGASUS: A policy search method for large MDPs and POMDPs. In UAI, 2000

Modeling:

Sutton, R. S. (1995). TD models: Modeling the world at a mixture of time scales. In Prieditis, A. and Russell, S., editors, Machine Learning: Proceedings of the Twelfth International Conference, pages 531--539. Morgan Kaufmann Publishers, San Francisco, CA

World Knowledge:

How Bodies Matter: Five Themes for Interaction Design., by Scott R. Klemmer, Bjoern Hartmann, and Leila Takayama

Application :

Smith M., Lee-Urban S., and Munoz-Avila H., RETALIATE: Learning Winning Policies in First-Person Shooter Games, Proceedings of the Seventeenth Innovative Applications of Artificial Intelligence Conference (IAAI-07). AAAI Press.

Extend this Page How to edit Style Subscribe Notify Suggest Help This open web page hosted at the University of Alberta. Terms of use 3449/0