Multi-armed Bandit Algorithms and Empirical Evaluation

Joannès Vermorel
ECML (2005), pp. 437-448

Abstract