An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits
An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits Authors: Andrea Tirinzoni, Matteo Pirotta, Marcello Restelli, Alessandro Lazaric Conference: NeurIPS 2020 Abstract: In the contextual linear bandit setting, algorithms built on the optimism principle fail to exploit the structure of the problem and have been shown to be asymptotically suboptimal. In this paper, we […]