Adaptive Federated Optimization

Sashank Reddi; Zachary Burr Charles; Manzil Zaheer; Zachary Garrett; Keith Rush; Jakub Konečný; Sanjiv Kumar; Brendan McMahan

Adaptive Federated Optimization

Sashank Reddi

Zachary Burr Charles

Manzil Zaheer

Zachary Garrett

Keith Rush

Jakub Konečný

Sanjiv Kumar

Brendan McMahan

(2021)

Download Google Scholar

Abstract

Federated learning is a distributed machine learning paradigm in which a large number of clients coordinate with a central server to learn a model without sharing their own training data. Due to the heterogeneity of the client datasets, standard federated optimization methods such as Federated Averaging (FedAvg) are often difficult to tune and exhibit unfavorable convergence behavior. In non-federated settings, adaptive optimization methods have had notable success in combating such issues. In this work, we propose federated versions of adaptive optimizers, including Adagrad, Yogi and Adam, and analyze their convergence in the presence of heterogeneous data for general nonconvex settings. Our results highlight the interplay between client heterogeneity and communication efficiency. We also perform extensive experiments on these methods and show that the use of adaptive optimizers can improve the performance of federated learning.

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Adaptive Federated Optimization

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs