A Comparative Analysis of Expected and Distributional Reinforcement Learning

Clare Lyle; Marc G. Bellemare; Pablo Samuel Castro

A Comparative Analysis of Expected and Distributional Reinforcement Learning

Clare Lyle

Marc G. Bellemare

Pablo Samuel Castro

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (2019)

Google Scholar

Abstract

Since their introduction a year ago, distributional approaches to reinforcement learning (distributional RL) have produced strong results relative to the standard, expectation-based, approach (expected RL). However, aside from theoretical convergence guarantees, there have been few theoretical results investigating the reasons behind the improvements distributional RL provides. In this paper we begin the investigation into this fundamental question by analyzing the differences in the tabular, linear approximation, and non-linear approximation settings. We prove theoretically that in the tabular and linear approximation settings, distributional RL does not provide an advantage over expected RL, and can in fact hurt performance. We then continue with an empirical analysis comparing distributional and expected RL methods in control settings with non-linear approximators to tease apart where the improvements from distributional RL methods are coming from.

Research Areas

Machine intelligence

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

A Comparative Analysis of Expected and Distributional Reinforcement Learning

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs