Behnam Neyshabur

I am a senior staff research scientist at Google. Before that, I was a postdoctoral researcher at New York University and a member of Theoretical Machine Learning program at Institute for Advanced Study (IAS) in Princeton. In summer 2017, I received a PhD in computer science at TTI-Chicago where I was fortunate to be advised by Nati Srebro. My current primary interest is reasoning and algorithmic capabilities of large language models but I have also not lost my interest in science of deep learning and (out-of-distribution) generalization.

Research Areas

Authored Publications

Google Publications

Other Publications

Revisiting Neural Scaling Laws in Language and Vision

Ibrahim Mansour I Alabdulmohsin

Behnam Neyshabur

Xiaohua Zhai

NeurIPS (2022)

Long Range Language Modeling via Gated State Spaces

Harsh Mehta

Ankit Gupta

Ashok Cutkosky

Behnam Neyshabur

Arxiv (2022)

The role of permutation invariance in linear mode connectivity of neural networks

Rahim Entezari

Hanie Sedghi

Olga Saukh

Behnam Neyshabur

ICLR (2022)

Block-Recurrent Transformers

Delesley Stuart Hutchins

Imanol Schlag

Yuhuai Wu

Ethan S Dyer

Behnam Neyshabur

NeurIPS (2022)

Exploring Length Generalization in Large Language Models

Cem Anil

Yuhuai Wu

Anders Andreassen

Aitor Lewkowycz

Vedant Misra

Vinay Venkatesh Ramasesh

Ambrose Slone

Guy Gur-Ari

Ethan S Dyer

Behnam Neyshabur

NeurIPS Oral (2022)

Leveraging Unlabeled Data to Predict Out-of-Distribution Performance

Saurabh Garg

Sivaraman Balakrishnan

Zachary Chase Lipton

Behnam Neyshabur

Hanie Sedghi

ICLR (2022)

Exploring the Limits of Large Scale Pre-training

Samira Abnar

Mostafa Dehghani

Behnam Neyshabur

Hanie Sedghi

ICLR Spotlight (2022)

A Loss Curvature Perspective On Training Instability in Deep Learning

Justin Gilmer

Behrooz Ghorbani

Ankush Garg

Sneha Reddy Kudugunta

Behnam Neyshabur

David Cardoze

George Edward Dahl

Zachary Nado

Orhan Firat

ICLR (2022)

Solving Quantitative Reasoning Problems with Language Models

Aitor Lewkowycz

Anders Andreassen

David Martin Dohan

Ethan S Dyer

Henryk Michalewski

Vinay Ramasesh

Ambrose Slone

Cem Anil

Imanol Schlag

Theo Gutman-Solo

Yuhuai Wu

Behnam Neyshabur

Guy Gur-Ari

Vedant Misra

NeurIPS (2022)

When Do Curricula Work?

Xiaoxia Wu

Ethan Dyer

Behnam Neyshabur

ICLR Oral (2021)

Deep Learning Through the Lens of Example Difficulty

Robert John Nicholas Baldock

Hartmut Maennel

Behnam Neyshabur

NeurIPS (2021)

Are wider nets better given the same number of parameters?

Anna Golubeva

Behnam Neyshabur

Guy Gur-Ari

ICLR (2021)

Extreme Memorization via Scale of Initialization

Harsh Meta

Ashok Cutkosky

Behnam Neyshabur

ICLR (2021)

Sharpness-aware Minimization for Efficiently Improving Generalization

Pierre Foret

Ariel Kleiner

Hossein Mobahi

Behnam Neyshabur

ICLR Spotlight (2021)

Methods and Analysis of The First Competition in Predicting Generalization of Deep Learning

Yiding Jiang

Parth Natekar

Manik Sharma

Sumukh K. Aithal

Dhruva Kashyap

Natarajan Subramanyam

Carlos Lassance

Daniel M. Roy

Gintare Karolina Dziugaite

Suriya Gunasekar

Isabelle Guyon

Pierre Foret

Scott Yak i

Hossein Mobahi

Behnam Neyshabur

Samy Bengio

Proceedings of the NeurIPS 2020 Competition and Demonstration Track, PMLR (2021)

The Deep Bootstrap: Good Online Learners are Good Offline Generalizers

Preetum Nakkiran

Behnam Neyshabur

Hanie Sedghi

ICLR (2021)

Understanding the failure modes of out-of-distribution generalization

Vaishnavh Nagarajan

Anders Andreassen

Behnam Neyshabur

ICLR (2021)

Avoiding Spurious Correlations: Bridging Theory and Practice

Thao Nguyen

Vaishnavh Nagarajan

Hanie Sedghi

Behnam Neyshabur

NeurIPS 2021 Workshop on Distribution Shifts: Connecting Methods and Applications

The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning

Anders Andreassen

Yasaman Bahri

Behnam Neyshabur

Rebecca Roelofs

arXiv (2021)

What is being transferred in transfer learning?

Behnam Neyshabur

Hanie Sedghi

Chiyuan Zhang

NeurIPS (2020)

Towards Learning Convolutions from Scratch

Behnam Neyshabur

NeurIPS (2020)

NeurIPS 2020 Competition: Predicting Generalization in Deep Learning

Yiding Jiang

Pierre Foret

Scott Yak

Daniel M. Roy

Hossein Mobahi

Gintare Karolina Dziugaite

Samy Bengio

Suriya Gunasekar

Isabelle Guyon

Behnam Neyshabur

arXiv (2020)

The intriguing role of module criticality in the generalization of deep networks

Niladri Chatterji

Behnam Neyshabur

Hanie Sedghi

ICLR Spotlight (2020)

Observational Overfitting in Reinforcement Learning

Behnam Neyshabur

Stephen Tu

Xingyou Song

Yiding Jiang

Yilun Du

ICLR (2020)

Fantastic Generalization Measures and Where to Find Them

Yiding Jiang

Behnam Neyshabur

Hossein Mobahi

Dilip Krishnan

Samy Bengio

ICLR (2020)

No Results Found

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Behnam Neyshabur

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Behnam Neyshabur

Research Areas

Filter by:

Year

Research Area

Join us

AI/ML Foundations  & Capabilities