Skip to content
YouTube Twitter RSS
BAnDits Against non-Stationarity and Structure (BADASS)

BAnDits Against non-Stationarity and Structure (BADASS)

ANR JCJC - BADASS project
Skip to content
  • Home
  • Project
    • Presentation
    • People
    • Workpackages
    • BADASS Lecture Series
  • News and events
    • Hiring of a new Ph.D. student.
    • Kick-off meeting
  • Internships, visits and positions
  • Publications
Taking bandits to the next step Improved strategies for Bandits Non-stationary reward signals Decision making in Structured Environments Dissemination

Taking bandits to the next step

Multi-armed Bandits are a building block of many applications in Machine Learning. Despite its success, many real-world applications face problems dealing with non-stationary and structured data. The goal of this young researcher project is to advance the understanding of multi-armed Bandits both in the usual stationary setting and the context non-stationary and structured environments, which requires combining the bandit literature with research fields such as time series, universal prediction, spectral methods, concentration of measure, etc.

Improved strategies for Bandits

In the stochastic multi-armed bandits, strategies such as Thompson sampling or KL-UCB have been shown to be provably optimal for specific classes of distributions, such as exponential families of dimension 1 or discrete distributions. Much remains to be done to understand optimality for more general classes, and for the analysis of novel promising strategies.

Non-stationary reward signals

One important challenge is to handle non-stationary reward processes. Departing from standard work on individual sequences and worst case approach, it seems important to build better adaptive prediction processes for classes of controlled non-stationarity and revisit aggregation of experts in this context.

Decision making in Structured Environments

Taking into account the structure of the decision making problem is crucial in a number of applications. Structure can be formalized in many ways, using e.g. graphs, spectral methods, Markov decision processes or a combination of them. Structure also naturally appears when dealing with non-stationarity.

Dissemination

Wokshops, tutorials, invited researchers and dissemination in international conferences are important building blocks to popularize the next generation of bandits algorithms.

The BADASS project

The BADASS project has ended in 2021. The publications of the project can be found at this place

The project "Bandits Against Non-stationarity and Structure" (BADASS) is a JCJC project funded by the French ANR (ANR-16-CE40-0002).
It is headed by Odalric-Ambrym Maillard, together with Emilie Kaufmann and Richard Combes.
We intend to focus on the following objectives:
  1. To broaden the range of optimal strategies for stationary MABs: current strategies are only known to be provably optimal in a limited range of scenarios for which the class of distribution (structure) is perfectly known; also, recent heuristics possibly adaptive to the class need to be further analyzed.
  2. To strengthen the literature on pure sequential prediction (focusing on a single arm) for non-stationary signals via the construction of adaptive confidence sets and a novel measure of complexity: traditional approaches consider a worst-case scenario and are thus overly conservative and non-adaptive to simpler signals.
  3. To embed the low-rank matrix completion and spectral methods in the context of reinforcement learning, and further study models of structured environments: promising heuristics in the context of e.g. contextual MABs or Predictive State Representations require stronger theoretical guarantees.


Visiting researcher: Junpei Komiyama

omaillar 2018/11/16 2018/11/16Uncategorized

We are glad that Junpei Komiyama is visiting us from October 01 to November 15.

Continue reading

Pr. Tze Leung Lai at the EWRL

omaillar 2018/11/16 2018/11/16Uncategorized

The ANR BADASS funded the visit of Professor Tze Leung Lai as an invited speaker at the European Workshop on Reinforcement Learning, October 01-03, Lille. This is a historical event for the field of multi-armed bandit and reinforcement learning.

Continue reading

Postdoctorant member

omaillar 2018/11/16 2018/11/16Uncategorized

We are happy to announce that Mohammad Sadegh Talebi has joined the BADASS project in June 2018.  

Continue reading

Hiring of a new Ph.D. student.

omaillar 2017/06/30 2017/06/30Uncategorized

We finally complete the hiring process for the Ph.D. student of the BADASS project, after a tough selection procedure involving candidates from all over the world.

Continue reading

Kick-off meeting

admin 2016/11/08 2016/11/18Project Management 0

The kick-off of the ANR BADASS project will be held on Friday 27, January at Inria Lille – Nord Europe.

Continue reading

The "Bandits Against Non-stationarity and Structure" (BADASS) project is JCJC project funded by the French ANR.
Powered by Nirvana & WordPress. Mentions légales & CGU & Politique de confidentialité & Cookies
YouTube Twitter RSS

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

BAnDits Against non-Stationarity and Structure (BADASS)
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.