1
Introduction
2
Stochastic multi-armed bandits
▶
2.1
Bandit model and probability space
2.2
Regret and other bandit quantities
2.3
Alternative model
3
Concentration inequalities
4
Bandit algorithms
▶
4.1
Explore-Then-Commit
4.2
UCB
5
Practical Algorithms
6
Sampling
Dependency graph
LeanBandits
Rémy Degenne
A Lean package for bandit algorithms
1
Introduction
2
Stochastic multi-armed bandits
2.1
Bandit model and probability space
2.2
Regret and other bandit quantities
2.3
Alternative model
3
Concentration inequalities
4
Bandit algorithms
4.1
Explore-Then-Commit
4.2
UCB
5
Practical Algorithms
6
Sampling