Topic
|
Slides
|
Homework
|
Rich's
old slides
|
Introduction
|
Chapter
1 |
All excercises from
the book |
|
Evaluative
feedback
|
Chapter
2 |
Exercises 2.17, 2.20, 2.21,
2.23, 2.27 and programming exercise 2.28 from here
|
Chapter
2 |
The
RL problem (MDPs, value functions, optimality)
|
Chapter 3
|
Homework 3 Due
date changed to Sep. 25
|
Chapter
3 |
Dynamic
programming + linear programming
|
Chapter 4
|
Homework 4
Due
date is Oct. 2
|
Chapter
4 |
Monte
Carlo methods
|
Chapter
5 |
Exercises 5.7 and
5.8
from here. Due:
Oct. 4
|
Chapter
5 |
Temporal-Difference
learning
|
Chapter 6
|
Homework 6. Due
date is Oct. 9
|
Chapter
6 |
Eligibility
traces
|
Chapter 7
|
Homework 7.
Due
date is Oct. 16
|
Chapter
7 |
Generalization
and function approximation
|
Chapter
8
|
Programming Exercise 8.8 of
Homework
8. Due date is Nov 1
|
Chapter
8 |
Planning
and learning, Prioritized Sweeping, dimensions
of RL
|
Chapter
9 |
|
Chapter
9 |
Policy gradient and
actor critic (Reading
Material)
|
Slides
|
Homework
Due.
Nov. 20
|
|
Least squares methods
|
Chapter
13:) (NEW, Dec. 10)
|
Homework
(NEW)
Due. Nov. 29 |
|
Hiearchical RL
|
|
|
|
|
|
|
Source files
(powerpoint) as a tar archive
(Jan 06) |
- There are no requirements on what programming language to use.
- You have to send me your program in e-mail with a subject in the following format:
"CMPUT607"<space><SID><space><ENUM>
Here
Csaba