November 2016 – Bandit Algorithms

Adversarial linear bandits and the curious case of the unit ball

Posted onNovember 25, 2016March 17, 2019Leave a comment

According to the main result of the previous post, given any finite action set $\cA$ with $K$ actions $a_1,\dots,a_K\in \R^d$, no matter how an adversary selects the loss vectors $y_1,\dots,y_n\in \R^d$, as long as the action losses $\ip{a_k,y_t}$ are in Continue Reading

Adversarial linear bandits

Posted onNovember 24, 2016March 17, 20191 Comment

In the next few posts we will consider adversarial linear bandits, which, up to a crude first approximation, can be thought of as the adversarial version of stochastic linear bandits. The discussion of the exact nature of the relationship between Continue Reading

Sparse linear bandits

Posted onNovember 21, 20161 Comment

In the last two posts we considered stochastic linear bandits, when the actions are vectors in the $d$-dimensional Euclidean space. According to our previous calculations, under the condition that the expected reward of all the actions are in a fixed Continue Reading

Ellipsoidal Confidence Sets for Least-Squares Estimators

Posted onNovember 13, 20168 Comments

Continuing the previous post, here we give a construction for confidence bounds based on ellipsoidal confidence sets. We also put things together and show bound on the regret of the UCB strategy that uses the constructed confidence bounds. Constructing the Continue Reading