Reinforcement Learning Software and Stuff
Amir Hesami's matlab version of the cart-pole system
, a graphics package for Macintosh Common Lispe
for Macintosh Common Lisp
Proposed Standard for RL Software
TD Model of Classical Conditioning in C++
Lisp code for Acrobot
(a code fragment, FYI)
combining TD(lambda) and backpropagation
as in Tesauro's (1992) backgammon player TD-Gammon. See also the following paper.
Technical report giving
detailed equations for the combination of TD(lambda) and backpropagation
. See also above pseudo-code.
Lisp code for a simple
illustrative example of an early version of Dyna-AHC
(also known as Dyna-PI). This is a pure commonlisp program without graphics.
The schedule of the workshop "
Reinforcement Learning: What We Know, What we Need
" held at the Univ. of Mass. as part of the 1993 Int. Machine Learning Conference. Serves as one view of the key topics in RL.
for reinforcement learning software directly related to the book:
Reinforcement Learning: An Introduction,
by Sutton and Barto.