Date
|
Presenter
|
Topic
|
Link
|
Room
|
|
[Organiser: Varun]
|
|
|
|
May 13
|
Rich Sutton
|
linear dyna: planning with an
approximate learned model of the world's dynamic
|
|
CSC - 333
|
May 14
|
Eric Wiewiora
|
Trends in Structured Prediction
|
|
CSC - 333
|
May 15
|
Csaba Szepesvari
|
Regret to the average vs. regret
to the best (Even-Dar et al., COLT-2007)
|
ppt,
pdf
paper
|
CSC - 333
|
|
[Organiser: Adam White]
|
|
|
|
May 20
|
Amir Massoud Farahmand
|
Regularized Fitted Q-Iteration
|
|
|
May 21
|
Masoud Shamari
|
Environment with Independent
Delayed-Sense Dynamics
|
|
|
May 22
|
Mike Bowling
|
Cancelled
|
|
|
|
[Organizer: Adam]
|
|
|
|
May 26
|
James
|
Autonomous Geocaching.
Thesis/AAMAS talk
|
|
|
May 27
|
Hamid
|
Trends in off-policy learning
with linear function approximation
|
|
|
May 28
|
David
|
|
|
|
May 29
|
Mike
|
|
|
|
|
[Organizer: Arash]
|
|
|
|
June 2
|
David
|
|
|
|
June 3
|
Elliot
|
|
|
|
June 4
|
Gabor
|
Wingate-Singh: Exponential
Family Predictive Representations of State NIPS 2007
|
|
|
June 5
|
Brad
|
Cancelled
|
|
|
|
[Organizer: Leah]
|
|
|
|
June 9
|
Brad
|
|
|
|
June 10
|
Marc
|
|
|
|
June 11
|
Yasin
|
Cancelled
|
|
|
June 12
|
Varun
|
Online linear regression and its
application to model-based RL (NIPS 2007
|
pdf
|
|
|
[Organizer: Yasin]
|
|
|
|
June 16
|
Arash
|
|
|
|
June 17
|
Vlad
|
|
|
|
June 18
|
Yasin
|
Time is Money!
|
|
|
June 19
|
Martha
|
Strategy Evaluation in Extensive
Games with Importance Sampling (2008)
|
|
|
|
[Organizer: Hamid]
|
|
|
|
June 23
|
Barnabas
|
Bregman Divergences
|
|
|
June 24
|
Siamak
|
Three Kinds of Probabilistic
Induction: Universal Distributions and Convergence Theorems
|
pdf
|
|
June 25
|
Leah
|
|
|
|
June 26
|
Mohammad
|
|
|
|
|
[Organizer:]
|
|
|
|
June 30
|
CANCELLED
|
CANCELLED
|
|
|
July 1
|
|
CANCELLED |
|
|
July 2
|
|
CANCELLED |
|
|
July 3
|
|
CANCELLED |
|
|
|
|
|
|
|
July 7
|
CANCELLED |
CANCELLED |
|
|
July 8
|
|
CANCELLED |
|
|
July 9
|
|
CANCELLED |
|
|
July 10
|
|
CANCELLED |
|
|
|
[Organizer: Amir Massoud]
|
|
|
|
July 14
|
Yavar
|
CANCELLED
|
|
|
July 15
|
Eric Wiewiora
|
Doya, et al. Multiple Model
Based Reinforcement Learning. Neural Computation, 2002.
|
html
|
|
July 16
|
Anna Koop
|
|
|
|
July 17
|
Adam White
|
The many faces of optimism: A
unifying Approach
|
|
|
|
[Organizer: Martha]
|
|
|
|
July 21
|
Hamid
|
Trends in off-policy TD
learning
with linear function approximation II |
|
|
July 22
|
Brian Tanner
|
RL-Competition 2008 Summary
Report (by request) and the RL RecordBook
|
|
|
July 23
|
Elliot
|
CANCELLED
|
|
|
July 24
|
Marc
|
|
|
|
|
[Organizer: Siamak] |
|
|
|
July 28
|
Yavar
|
|
|
|
July 29
|
Elliot
|
|
|
|
July 30
|
Gabor
|
PSR On-line sequential
bin packing -- András György, Gábor Lugosi, György Ottucsák (COLT
2008)
|
|
|
July 31
|
Brad
|
CANCELLED
|
|
|
|
[Organizer: Marc]
|
|
|
|
Aug 4
|
Civic Holiday
|
CANCELLED
|
|
|
Aug 5
|
Barnabas
|
|
|
|
Aug 6
|
Rich Sutton
|
The Critterbot Project
|
|
|
Aug 7
|
|
CANCELLED
|
|
|
|
[Organizer: Yavar]
|
|
|
|
Aug 11
|
Brad
|
|
|
|
Aug 12
|
Mike Sokolsky
|
the system architecture of the
critterbot
|
|
|
Aug 13
|
Varun
|
CANCELLED
|
|
|
Aug 14
|
Amir massoud
|
Compressive Sampling
|
|
|
|
[Organizer: Eric Wiewiora] |
|
|
|
Aug 18
|
Martha
|
"Adapting Bias by Gradient
Descent:An Incremental Version of Delta-Bar-Delta" (and further work on
meta-learning)
|
|
|
Aug 19
|
Siamak
|
|
|
|
Aug 20
|
Varun
|
Overview of Active learning
|
slides
|
|
Aug 21
|
CANCELLED
|
|
|
|
|
[Organizer: Marc]
|
|
|
|
Aug 25
|
CANCELLED
|
|
|
|
Aug 26
|
Csaba Szespesvari
|
An analysis of linear models,
linear value-function approximation, and feature selection for
reinforcement learning
|
PDF
|
|
Aug 27
|
Tom
|
|
|
|
Aug 28
|
Yasin
|
|
|
|