Ethan S Dyer

Research Areas

Authored Publications

Google Publications

Other Publications

Effect of scale on catastrophic forgetting in neural networks

Ethan Dyer

Aitor Lewkowycz

Vinay Ramasesh

ICLR (2022) (to appear)

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Aitor Lewkowycz

Ambrose Slone

Anders Andreassen

Daniel Freeman

Ethan S Dyer

Gaurav Mishra

Guy Gur-Ari

Jaehoon Lee

Jascha Sohl-dickstein

Kristen Chiafullo

Liam B. Fedus

Noah Fiedel

Rosanne Liu

Vedant Misra

Vinay Venkatesh Ramasesh

TBD (2022)

Exploring Length Generalization in Large Language Models

Cem Anil

Yuhuai Wu

Anders Andreassen

Aitor Lewkowycz

Vedant Misra

Vinay Venkatesh Ramasesh

Ambrose Slone

Guy Gur-Ari

Ethan S Dyer

Behnam Neyshabur

NeurIPS Oral (2022)

Solving Quantitative Reasoning Problems with Language Models

Aitor Lewkowycz

Anders Andreassen

David Martin Dohan

Ethan S Dyer

Henryk Michalewski

Vinay Ramasesh

Ambrose Slone

Cem Anil

Imanol Schlag

Theo Gutman-Solo

Yuhuai Wu

Behnam Neyshabur

Guy Gur-Ari

Vedant Misra

NeurIPS (2022)

Block-Recurrent Transformers

Delesley Stuart Hutchins

Imanol Schlag

Yuhuai Wu

Ethan S Dyer

Behnam Neyshabur

NeurIPS (2022)

When Do Curricula Work?

Xiaoxia Wu

Ethan Dyer

Behnam Neyshabur

ICLR Oral (2021)

Tradeoffs in Data Augmentation: An Empirical Study

Ekin Dogus Cubuk

Ethan S Dyer

Rapha Gontijo Lopes

Sylvia Smullin

ICLR (2021)

Explaining Neural Scaling Laws

Ethan S Dyer

Jaehoon Lee

Jared D Kaplan

Utkarsh Sharma

Yasaman Bahri

arxiv (2021)

Whitening and second order optimization both destroy information about the dataset, and can make generalization impossible

Daniel Duckworth

Ethan S Dyer

Jascha Sohl-dickstein

Neha Wadia

Sam S. Schoenholz

ICML Spotlight (2021) (to appear)

Asymptotics of Wide Convolutional Neural Networks

Anders Andreassen

Ethan Dyer

arXiv (2020)

Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics

Maithra Raghu

Vinay Ramasesh

Ethan Dyer

ICML, ICLR (2020)

The large learning rate phase of deep learning

Aitor Lewkowycz

Ethan S Dyer

Guy Gur-Ari

Jascha Sohl-dickstein

Yasaman Bahri

arxiv (2020)

Asymptotics of Wide Networks from Feynman Diagrams

Guy Gur-Ari

Ethan Dyer

ICLR Spotlight (2019) (to appear)

Gradient Descent Happens in a Tiny Subspace

Guy Gur-Ari

Ethan Dyer

Daniel A. Roberts

arxiv (2018)

The Most Irrational Rational Theories

Nathan Benjamin

Ethan Dyer

A. Liam Fitzpatrick

Yuan Xin

JHEP, vol. 04 (2018), pp. 025

No Results Found

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Ethan S Dyer

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Ethan S Dyer

Research Areas

Filter by:

Year

Research Area

Join us

AI/ML Foundations  & Capabilities