Bandit Algorithms

Bandits: A new beginning

Posted onSeptember 4, 201617 Comments

Dear Interested Reader, Together with Tor, we have worked a lot on bandit problems in the past and developed a true passion for them. At the pressure of some friends and students (and a potential publisher), and also just to Continue Reading

Posted onAugust 1, 2016March 28, 20193 Comments

Bandits: A new beginning Finite-armed stochastic bandits: Warming up First steps: Explore-then-Commit The Upper Confidence Bound (UCB) Algorithm Optimality concepts and information theory More information theory and minimax lower bounds Instance dependent lower bounds Adversarial bandits High probability lower bounds Continue Reading