- Max Schwarzer
- Johan Obando Ceron
- Aaron Courville
- Marc Bellemare
- Pablo Samuel Castro
- Rishabh Agarwal
ICML (2023)
We introduce a value-based RL agent, which we call BBF, that achieves super-human performance in the Atari 100K benchmark. BBF relies on scaling the neural networks used for value estimation, as well as a number of other design choices that enable this scaling in a sample-efficient manner. We conduct extensive analyses of these design choices and provide insights for future work. We end with a discussion about moving the goalpost for sample-efficient RL research on the ALE.
We maintain a portfolio of research projects, providing individuals and teams the freedom to emphasize specific types of work