Rémy Leluc’s internship will be on adaptive Monte Carlo methods, in the context of reinforcement learning, examined from a theoretical point of view (convergence and inequalities, theoretical bounds) and from a practical point of view (implementation of new methods and comparisons with state-of-the art methods). He is being supervised by François Portier, lecturer at Télécom Paris and Pascal Bianchi professor at Télécom Paris.
Keywords: reinforcement learning, examined from a theoretical point of view, convergence and inequalities, theoretical bounds, state-of-the art methods