The project is organized along three main research work packages.
Work package 1: Transfer of exploration-exploitation strategies
Task T1.1: Knowledge transfer in multi-armed bandit.
Task T1.2: Towards exploration-exploitation transfer in RL.
Work package 2: Transfer solutions for approximated reinforcement learning
Task T2.1: Sample complexity reduction
Task T2.2: Automatic feature generation
Work package 3: Hierarchical transfer reinforcement learning
Task T3.1: Automatic construction of skills
Task T3.2: Hierarchical approximation schemes
Task T3.3: Integration and testing in complex environments