Sequential Learning

Most of the course is adapted from the course given last year by Emilie Kaufmann (see her page).
For information about projects, see here

Bandit Algorithms. Tor Lattimore and Csaba Szepesvari (2019).
Reinforcement Learning. Richard Sutton and Andrew Barto (2018 edition).
Reinforcement Learning Algorithms. Csaba Szepesvari (2009).
Markov Decision Processes. Martin Puterman (1994).
Lecture notes of similar courses written by several colleagues: Emilie Kaufmann, Rémi Munos, Alessandro Lazaric and Aurélien Garivier.