Behnam Neyshabur

I am a senior staff research scientist at Google. Before that, I was a postdoctoral researcher at New York University and a member of Theoretical Machine Learning program at Institute for Advanced Study (IAS) in Princeton. In summer 2017, I received a PhD in computer science at TTI-Chicago where I was fortunate to be advised by Nati Srebro. My current primary interest is reasoning and algorithmic capabilities of large language models but I have also not lost my interest in science of deep learning and (out-of-distribution) generalization.

Research Areas

Authored Publications

The role of permutation invariance in linear mode connectivity of neural networks

Rahim Entezari

Hanie Sedghi

Olga Saukh

Behnam Neyshabur

ICLR(2022)

Revisiting Neural Scaling Laws in Language and Vision

Ibrahim Mansour I Alabdulmohsin

Behnam Neyshabur

Xiaohua Zhai

NeurIPS(2022)

Solving Quantitative Reasoning Problems with Language Models

Aitor Lewkowycz

Anders Andreassen

David Martin Dohan

Ethan S Dyer

Henryk Michalewski

Vinay Ramasesh

Ambrose Slone

Cem Anil

Imanol Schlag

Theo Gutman-Solo

Yuhuai Wu

Behnam Neyshabur

Guy Gur-Ari

Vedant Misra

NeurIPS(2022)

Exploring Length Generalization in Large Language Models

Cem Anil

Yuhuai Wu

Anders Andreassen

Aitor Lewkowycz

Vedant Misra

Vinay Venkatesh Ramasesh

Ambrose Slone

Guy Gur-Ari

Ethan S Dyer

Behnam Neyshabur

NeurIPS Oral(2022)

Leveraging Unlabeled Data to Predict Out-of-Distribution Performance

Saurabh Garg

Sivaraman Balakrishnan

Zachary Chase Lipton

Behnam Neyshabur

Hanie Sedghi

ICLR(2022)

Exploring the Limits of Large Scale Pre-training

Samira Abnar

Mostafa Dehghani

Behnam Neyshabur

Hanie Sedghi

ICLR Spotlight(2022)

Block-Recurrent Transformers

Delesley Stuart Hutchins

Imanol Schlag

Yuhuai Wu

Ethan S Dyer

Behnam Neyshabur

NeurIPS(2022)

Long Range Language Modeling via Gated State Spaces

Harsh Mehta

Ankit Gupta

Ashok Cutkosky

Behnam Neyshabur

Arxiv(2022)

A Loss Curvature Perspective On Training Instability in Deep Learning

Justin Gilmer

Behrooz Ghorbani

Ankush Garg

Sneha Reddy Kudugunta

Behnam Neyshabur

David Cardoze

George Edward Dahl

Zachary Nado

Orhan Firat

ICLR(2022)

The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning

Anders Andreassen

Yasaman Bahri

Behnam Neyshabur

Rebecca Roelofs

arXiv(2021)

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Behnam Neyshabur

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Behnam Neyshabur

Research Areas

Filter by:

Publications

Years

Research Areas

Teams

Join us

AI/ML Foundations  & Capabilities