Four papers on multi-armed bandit and reinforcement learning accepted at AI&Stats’17
We had four papers accepted at AI&Stats’17 on multi-armed bandit and reinforcement learning. The most interesting result is the first regret bound on learning in MDPs with options. For the …