Table of Contents
Bandits: A new beginning Finite-armed stochastic bandits: Warming up First steps: Explore-then-Commit The Upper Confidence Bound (UCB) Algorithm Optimality concepts and information theory More information theory and minimax lower bounds Instance dependent lower bounds Adversarial bandits High probability lower bounds Continue Reading