Reinforcement Learning Software and Stuff

Proposed Standard for RL Software
TD Model of Classical Conditioning in C++ and Lisp
Lisp code for Acrobot (a code fragment, FYI)
Pseudo-code for combining TD(lambda) and backpropagation as in Tesauro's (1992) backgammon player TD-Gammon. See also the following paper.
Technical report giving detailed equations for the combination of TD(lambda) and backpropagation. See also above pseudo-code.
Lisp code for a simple illustrative example of an early version of Dyna-AHC (also known as Dyna-PI). This is a pure commonlisp program without graphics.
The schedule of the workshop "Reinforcement Learning: What We Know, What we Need" held at the Univ. of Mass. as part of the 1993 Int. Machine Learning Conference. Serves as one view of the key topics in RL.

See here for reinforcement learning software directly related to the book: Reinforcement Learning: An Introduction, by Sutton and Barto.