Visit for course dates & times.
Note: This is the 2021–2022 eCalendar. Update the year in your browser's URL bar for the most recent version of this page, or .
Overview
Computer Science (Sci) : Bandit algorithms, finite Markov decision processes, dynamic programming, Monte-Carlo Methods, temporal-difference learning, bootstrapping, planning, approximation methods, on versus off policy learning, policy gradient methods temporal abstraction and inverse reinforcement learning.
Terms: Winter 2022
Instructors: Precup, Doina (Winter)