Jump to Content

Defining the technology of today and tomorrow.

Philosophy

We strive to create an environment conducive to many different types of research across many different time scales and levels of risk.
Learn more about our Philosophy

Philosophy

People

Our researchers drive advancements in computer science through both fundamental and applied research.
Learn more about our People

People
Research areas

Explore all research areas

Explore all research areas

Foundational ML & Algorithms

Algorithms & Theory

Data Management

Data Mining & Modeling

Information Retrieval & the Web

Machine Intelligence

Machine Perception

Machine Translation

Natural Language Processing

Speech Processing

Algorithms & Theory

Data Management

Data Mining & Modeling

Information Retrieval & the Web

Machine Intelligence

Machine Perception

Machine Translation

Natural Language Processing

Speech Processing

Computing Systems & Quantum AI

Distributed Systems & Parallel Computing

Hardware & Architecture

Mobile Systems

Networking

Quantum Computing

Robotics

Security, Privacy, & Abuse Prevention

Software Engineering

Software Systems

Distributed Systems & Parallel Computing

Hardware & Architecture

Mobile Systems

Networking

Quantum Computing

Robotics

Security, Privacy, & Abuse Prevention

Software Engineering

Software Systems

Science, AI & Society

Climate & Sustainability

Economics & Electronic Commerce

Education Innovation

General Science

Health & Bioscience

Human-Computer Interaction and Visualization

Responsible AI

Climate & Sustainability

Economics & Electronic Commerce

Education Innovation

General Science

Health & Bioscience

Human-Computer Interaction and Visualization

Responsible AI
Projects

We regularly open-source projects with the broader research community and apply our developments to Google products.
Learn more about our Projects

Projects

Publications

Publishing our work allows us to share ideas and work collaboratively to advance the field of computer science.
Learn more about our Publications

Publications

Resources

We make products, tools, and datasets available to everyone with the goal of building a more collaborative ecosystem.
Learn more about our Resources

Resources
Shaping the future, together.
Collaborate with us

Student programs

Supporting the next generation of researchers through a wide range of programming.
Learn more about our Student programs

Student programs

Faculty programs

Participating in the academic research community through meaningful engagement with university faculty.
Learn more about our Faculty programs

Faculty programs

Conferences & events

Connecting with the broader research community through events is essential for creating progress in every aspect of our work.
Learn more about our Conferences & events

Conferences & events

Collaborate with us
Careers
Blog

Large-scale optimization

Our mission is to develop large-scale optimization techniques and use them to improve the efficiency and robustness of infrastructure at Google.

About the team

We apply techniques from large-scale combinatorial optimization, online algorithms, and control theory to make Google’s computing infrastructure do more with less. We combine online and offline optimizations to achieve goals such as reducing search query latency, increasing model inference throughput and prediction quality, minimizing resource contention, maximizing the efficacy of caches, and eliminating unnecessary work in distributed systems. Our research is used in critical infrastructure that supports Search, Ads, Gemini, YouTube, and Cloud products.

Team focus summaries

Load balancing

Large-scale linear programming

ML model structure optimization

Search infrastructure optimization

Distributed optimization based on core-sets

Large-scale set cover

Consistent hashing

Featured publications

Load is not what you should balance: Introducing Prequal

Bartek Wydrowski

Bobby Kleinberg

Steve Rumble

Aaron Archer

(2024)

Sequential Attention for Feature Selection

Taisuke Yasuda

MohammadHossein Bateni

Lin Chen

Matthew Fahrbach

Gang Fu

Vahab Mirrokni

Proceedings of the 11th International Conference on Learning Representations (2023)

Practical Large-Scale Linear Programming using Primal-Dual Hybrid Gradient

David Applegate

Mateo Díaz

Oliver Hinder

Haihao Lu

Miles Lubin

Brendan O'Donoghue

Warren Schudy

NeurIPS 2021

Edge-Weighted Online Bipartite Matching

Matthew Fahrbach

Morteza Zadimoghaddam

Runzhou Tao

Zhiyi Huang

Journal of the ACM, 69 (2022), 45:1-45:35

Cache-aware load balancing of data center applications

Aaron Archer

Kevin Aydin

MohammadHossein Bateni

Vahab Mirrokni

Aaron Schild

Ray Yang

Richard Zhuang

Proceedings of the VLDB Endowment, 12 (2019), pp. 709-723

Submodular Maximization with Nearly Optimal Approximation, Adaptivity and Query Complexity

Matthew Fahrbach

Vahab Mirrokni

Morteza Zadimoghaddam

Proceedings of the 30th Annual ACM-SIAM Symposium on Discrete Algorithms (2019), pp. 255-273

Consistent Hashing with Bounded Loads

Vahab Mirrokni

Mikkel Thorup

Morteza Zadimoghaddam

Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms (2018), pp. 587-604

Almost Optimal Streaming Algorithms for Coverage Problems

Mohammadhossein Bateni

Hossein Esfandiari

Vahab Mirrokni

29th ACM Symposium on Parallelism in Algorithms and Architectures (2017)

Randomized Composable Core-sets for Distributed Submodular Maximization

Vahab S. Mirrokni

Morteza Zadimoghaddam

STOC (2015), pp. 153-162

HyperAttention: Large-scale Attention in Linear Time

Amin Karbasi

Amir Zandieh

Insu Han

Rajesh Jayaram

Vahab Mirrokni

David Woodruff

HyperAttention: Long-context Attention in Near-Linear Time (2024) (to appear)

Highlighted work

Some of our locations

Some of our people