Publications

Publications

Publications HAL du projet ANR. Extra-learn

2020

Journal articles

ref_biblio: Tomáš Kocák, Rémi Munos, Branislav Kveton, Shipra Agrawal, Michal Valko. Spectral bandits. Journal of Machine Learning Research, 2020. ⟨hal-03084249⟩
Accès au texte intégral et bibtex

2019

Conference papers

ref_biblio: Peter Bartlett, Victor Gabillon, Michal Valko. A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption. Algorithmic Learning Theory, 2019, Chicago, United States. ⟨hal-01885368v2⟩
Accès au texte intégral et bibtex

ref_biblio: Xuedong Shang, Emilie Kaufmann, Michal Valko. General parallel optimization without a metric. Algorithmic Learning Theory, 2019, Chicago, United States. ⟨hal-02047225v2⟩
Accès au texte intégral et bibtex

Theses

ref_biblio: Ronan Fruit. Exploration-exploitation dilemma in Reinforcement Learning under various form of prior knowledge. Artificial Intelligence [cs.AI]. Université de Lille 1, Sciences et Technologies; CRIStAL UMR 9189, 2019. English. ⟨NNT : ⟩. ⟨tel-02388395v2⟩
Accès au texte intégral et bibtex

2018

Conference papers

ref_biblio: Ronan Fruit, Matteo Pirotta, Alessandro Lazaric. Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes. 32nd Conference on Neural Information Processing Systems, Dec 2018, Montréal, Canada. ⟨hal-01941220⟩
Accès au texte intégral et bibtex

ref_biblio: Xuedong Shang, Emilie Kaufmann, Michal Valko. Adaptive black-box optimization got easier: HCT only needs local smoothness. European Workshop on Reinforcement Learning, Oct 2018, Lille, France. ⟨hal-01874637⟩
Accès au texte intégral et bibtex

ref_biblio: Matteo Papini, Damiano Binaghi, Giuseppe Canonaco, Matteo Pirotta, Marcello Restelli. Stochastic Variance-Reduced Policy Gradient. ICML 2018 – 35th International Conference on Machine Learning, Jul 2018, Stockholm, Sweden. pp.4026-4035. ⟨hal-01940394⟩
Accès au texte intégral et bibtex

ref_biblio: Ronan Fruit, Matteo Pirotta, Alessandro Lazaric, Ronald Ortner. Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning. ICML 2018 – The 35th International Conference on Machine Learning, Jul 2018, Stockholm, Sweden. pp.1578-1586. ⟨hal-01941206⟩
Accès au texte intégral et bibtex

ref_biblio: Andrea Tirinzoni, Andrea Sessa, Matteo Pirotta, Marcello Restelli. Importance Weighted Transfer of Samples in Reinforcement Learning. ICML 2018 – The 35th International Conference on Machine Learning, Jul 2018, Stockholm, Sweden. pp.4936-4945. ⟨hal-01941213⟩
Accès au texte intégral et bibtex

ref_biblio: Yasin Abbasi-Yadkori, Peter Bartlett, Victor Gabillon, Alan Malek, Michal Valko. Best of both worlds: Stochastic & adversarial best-arm identification. Conference on Learning Theory, 2018, Stockholm, Sweden. ⟨hal-01808948v6⟩
Accès au texte intégral et bibtex

2017

Conference papers

ref_biblio: Ronan Fruit, Matteo Pirotta, Alessandro Lazaric, Emma Brunskill. Regret Minimization in MDPs with Options without Prior Knowledge. NIPS 2017 – Neural Information Processing Systems, Dec 2017, Long Beach, United States. pp.1-36. ⟨hal-01649082⟩
Accès au texte intégral et bibtex

ref_biblio: Carlos Riquelme, Mohammad Ghavamzadeh, Alessandro Lazaric. Active Learning for Accurate Estimation of Linear Models. ICML 2017 – 34th International Conference on Machine Learning, Aug 2017, Sydney, Australia. pp.36. ⟨hal-01538762⟩
Accès au texte intégral et bibtex

ref_biblio: Marc Abeille, Alessandro Lazaric. Linear Thompson Sampling Revisited. AISTATS 2017 – 20th International Conference on Artificial Intelligence and Statistics, Apr 2017, Fort Lauderdale, United States. ⟨hal-01493561⟩
Accès au texte intégral et bibtex

ref_biblio: Ronan Fruit, Alessandro Lazaric. Exploration–Exploitation in MDPs with Options. AISTATS 2017 – 20th International Conference on Artificial Intelligence and Statistics, Apr 2017, Fort Lauderdale, United States. ⟨hal-01493567v2⟩
Accès au texte intégral et bibtex

ref_biblio: Marc Abeille, Alessandro Lazaric. Thompson Sampling for Linear-Quadratic Control Problems. AISTATS 2017 – 20th International Conference on Artificial Intelligence and Statistics, Apr 2017, Fort Lauderdale, United States. ⟨hal-01493564⟩
Accès au texte intégral et bibtex

ref_biblio: Daniele Calandriello, Alessandro Lazaric, Michal Valko. Efficient second-order online kernel learning with adaptive embedding. Neural Information Processing Systems, 2017, Long Beach, United States. ⟨hal-01643961⟩
Accès au texte intégral et bibtex

ref_biblio: Daniele Calandriello, Alessandro Lazaric, Michal Valko. Second-Order Kernel Online Convex Optimization with Adaptive Sketching. International Conference on Machine Learning, 2017, Sydney, Australia. ⟨hal-01537799⟩
Accès au texte intégral et bibtex

ref_biblio: Guillaume Gautier, Rémi Bardenet, Michal Valko. Zonotope hit-and-run for efficient sampling from projection DPPs. International Conference on Machine Learning, 2017, Sydney, Australia. ⟨hal-01526577v2⟩
Accès au texte intégral et bibtex

ref_biblio: Daniele Calandriello, Alessandro Lazaric, Michal Valko. Distributed adaptive sampling for kernel matrix approximation. International Conference on Artificial Intelligence and Statistics, 2017, Fort Lauderdale, United States. ⟨hal-01482760⟩
Accès au texte intégral et bibtex

ref_biblio: Akram Erraqabi, Alessandro Lazaric, Michal Valko, Emma Brunskill, Yun-En Liu. Trading off rewards and errors in multi-armed bandits. International Conference on Artificial Intelligence and Statistics, 2017, Fort Lauderdale, United States. ⟨hal-01482765⟩
Accès au texte intégral et bibtex

2016

Conference papers

ref_biblio: Jean-Bastien Grill, Michal Valko, Rémi Munos. Blazing the trails before beating the path: Sample-efficient Monte-Carlo planning. Neural Information Processing Systems, Dec 2016, Barcelona, Spain. ⟨hal-01389107v3⟩
Accès au texte intégral et bibtex

ref_biblio: Daniele Calandriello, Alessandro Lazaric, Michal Valko. Analysis of Nyström method with sequential ridge leverage score sampling. Uncertainty in Artificial Intelligence, Jun 2016, New York City, United States. ⟨hal-01343674⟩
Accès au texte intégral et bibtex

ref_biblio: Tomáš Kocák, Gergely Neu, Michal Valko. Online learning with Erdős-Rényi side-observation graphs. Uncertainty in Artificial Intelligence, Jun 2016, New York City, United States. ⟨hal-01320588⟩
Accès au texte intégral et bibtex

ref_biblio: Akram Erraqabi, Michal Valko, Alexandra Carpentier, Odalric-Ambrym Maillard. Pliable rejection sampling. International Conference on Machine Learning, Jun 2016, New York City, United States. ⟨hal-01322168⟩
Accès au texte intégral et bibtex

ref_biblio: Kamyar Azizzadenesheli, Alessandro Lazaric, Animashree Anandkumar. Reinforcement Learning of POMDPs using Spectral Methods. Proceedings of the 29th Annual Conference on Learning Theory (COLT2016), Jun 2016, New York City, United States. ⟨hal-01322207⟩
Accès au texte intégral et bibtex

ref_biblio: Tomáš Kocák, Gergely Neu, Michal Valko. Online learning with noisy side observations. International Conference on Artificial Intelligence and Statistics, May 2016, Seville, Spain. ⟨hal-01303377⟩
Accès au texte intégral et bibtex

ref_biblio: Alexandra Carpentier, Michal Valko. Revealing graph bandits for maximizing local influence. International Conference on Artificial Intelligence and Statistics, May 2016, Seville, Spain. ⟨hal-01304020v3⟩
Accès au texte intégral et bibtex

ref_biblio: Victor Gabillon, Alessandro Lazaric, Mohammad Ghavamzadeh, Ronald Ortner, Peter Bartlett. Improved Learning Complexity in Combinatorial Pure Exploration Bandits. Proceedings of the 19th International Conference on Artificial Intelligence (AISTATS), May 2016, Cadiz, Spain. ⟨hal-01322198⟩
Accès au texte intégral et bibtex

2015

Conference papers

ref_biblio: Jessica Chemali, Alessandro Lazaric. Direct Policy Iteration with Demonstrations. IJCAI – 24th International Joint Conference on Artificial Intelligence, Jul 2015, Buenos Aires, Argentina. ⟨hal-01237659⟩
Accès au texte intégral et bibtex

ref_biblio: Julien Audiffren, Michal Valko, Alessandro Lazaric, Mohammad Ghavamzadeh. Maximum Entropy Semi-Supervised Inverse Reinforcement Learning. International Joint Conference on Artificial Intelligence, Jul 2015, Bueons Aires, Argentina. ⟨hal-01146187⟩
Accès au texte intégral et bibtex

ref_biblio: Daniele Calandriello, Alessandro Lazaric, Michal Valko. Large-scale semi-supervised learning with online spectral graph sparsification. Resource-Efficient Machine Learning workshop at International Conference on Machine Learning, Jul 2015, Lille, France. ⟨hal-01544929⟩
Accès au texte intégral et bibtex

ref_biblio: Alexandra Carpentier, Michal Valko. Simple regret for infinitely many armed bandits. International Conference on Machine Learning, Jul 2015, Lille, France. ⟨hal-01153538⟩
Accès au texte intégral et bibtex

ref_biblio: Manjesh Kumar Hanawal Hanawal, Venkatesh Saligrama, Michal Valko, Rémi Munos. Cheap Bandits. International Conference on Machine Learning, 2015, Lille, France. ⟨hal-01153540⟩
Accès au texte intégral et bibtex

ref_biblio: Jean-Bastien Grill, Michal Valko, Rémi Munos. Black-box optimization of noisy functions with unknown smoothness. Neural Information Processing Systems, 2015, Montréal, Canada. ⟨hal-01222915v4⟩
Accès au texte intégral et bibtex

2014

Conference papers

ref_biblio: Daniele Calandriello, Alessandro Lazaric, Marcello Restelli. Sparse Multi-task Reinforcement Learning. NIPS – Advances in Neural Information Processing Systems 26, Dec 2014, Montreal, Canada. ⟨hal-01073513⟩
Accès au texte intégral et bibtex

ref_biblio: Julien Audiffren, Michal Valko, Alessandro Lazaric, Mohammad Ghavamzadeh. MESSI: Maximum Entropy Semi-Supervised Inverse Reinforcement Learning. NIPS Workshop on Novel Trends and Applications in Reinforcement Learning, 2014, Montreal, Canada. ⟨hal-01177446⟩
Accès au texte intégral et bibtex

Comments are closed.