Learning theory

About the team

We are dedicated to advancing the theoretical foundations of machine learning (ML). Our team has extensive expertise in a variety of areas, including learning theory, statistical learning theory, optimization, decision making under uncertainty, reinforcement learning, and theory and algorithms in general. Our mission is twofold: to foster a principled understanding of ML techniques and to leverage this knowledge in designing highly effective algorithms. Ultimately, we aim to deploy these algorithms to achieve significant impact on Google, the wider academic community, and the scientific field of ML as a whole.

Team focus summaries

Optimization for machine learning

We work on optimization methods for machine learning in application areas, such as training large language models and federated learning.

Reinforcement learning

We design theoretically sound algorithms to solve real-world reinforcement learning problems, with applications including recommendation tasks, optimization of computer systems and fine-tuning of generative models.

Online learning, bandits, and active learning

Our research focuses on crafting algorithms and strategies for making sequential decisions in dynamic and uncertain environments based on partial information.

Learning dynamics in games

Multiplayer games provide a framework to understand the way that both humans and algorithms interact in complex systems, and we hope to understand and carefully design these systems to balance efficiency and equity.

Privacy

We work on developing algorithms for training machine learning models with differential privacy, as well as alternative privacy guarantees.

Generalization

We develop new learning algorithms with generalization guarantees for various learning scenarios.

Featured publications

On the convergence of Adam and Beyond

Sashank Reddi

Satyen Kale

Sanjiv Kumar

International Conference on Learning Representations(2018)

Easy Learning from Label Proportions

Andres Munoz Medina

Claudio Gentile

Robert Busa-Fekete

Travis Dick

Heejin Choi

Neurips(2023)

Multiple-policy High-confidence Policy Evaluation

Christoph Dann

Mohammad Ghavamzadeh

Teodor Marinov

International Conference on Artificial Intelligence and Statistics(2023), pp. 9470-9487

Foundations of Machine Learning

Mehryar Mohri

Afshin Rostamizadeh

Ameet Talwalkar

The MIT Press(2018)

Preview

Layerwise Bregman Representation Learning of Neural Networks with Applications to Knowledge Distillation

Ehsan Amid

Rohan Anil

Christopher Fifty

Manfred Warmuth

Transactions on Machine Learning Research, 02/23(2023)

Learning in POMDPs is Sample-Efficient with Hindsight Observability

Jonathan Lee

Alekh Agarwal

Christoph Dann

Tong Zhang

ICML 2023(2023)

A Model Selection Approach for Corruption Robust Reinforcement Learning

Chen-Yu Wei

Christoph Dann

Julian Zimmert

33rd International Conference on Algorithmic Learning Theory (ALT 2022)(2022)

Model-based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity

Alekh Agarwal

Tong Zhang

Neural Information Processing Systems(2023)

Efficient Training of Language Models using Few-Shot Learning

Sashank Reddi

Sobhan Miryoosefi

Stefani Karp

Shankar Krishnan

Satyen Kale

Seungyeon Kim

Sanjiv Kumar

ICML(2023)

Some of our locations

New York

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

About the team

Team focus summaries

Optimization for machine learning

Reinforcement learning

Online learning, bandits, and active learning

Learning dynamics in games

Privacy

Generalization

Featured publications

Some of our locations

Some of our people

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Learning theory

About the team

Team focus summaries

Optimization for machine learning

Reinforcement learning

Online learning, bandits, and active learning

Learning dynamics in games

Privacy

Generalization

Featured publications

Some of our locations

Some of our people

Join us

AI/ML Foundations  & Capabilities