Jeffrey Dean

I joined Google in mid-1999, and I'm currently Google's Chief Scientist, focusing on AI advances for Google DeepMind and Google Research. My areas of focus include machine learning and AI and applications of AI to problems that help billions of people in societally beneficial ways. I have a broad variety of interests, including machine learning, large-scale distributed systems, computer systems performance, compression techniques, information retrieval, application of machine learning to search and other related problems, microprocessor architecture, compiler optimizations, and the development of new products that organize information in new and interesting ways. My Google Scholar page has a complete list of research papers I have co-authored.

In 2011, I co-founded the Google Brain project/team, focused on making progress towards intelligent machines. Since then, my individual work has focused on research, systems and applications for AI and ML, as well as steering the direction of our broader AI/ML and computer science research community. For the past few years, I’ve had the great pleasure to write a blog post early each year summarizing many pieces of the public work done by amazing colleagues and researchers over the previous year in our research teams (despite the similar-sounding titles, these annual blog posts are each quite different!).

Jan 2023: Google Research, 2022 & beyond: Language, vision and generative models (part 1 of a 9-part series)
Jan 2022: Google Research: Themes from 2021 and Beyond
Jan 2021: Google Research: Looking Back at 2020, and Forward to 2021
Jan 2020: Google Research: Looking Back at 2019, and Forward to 2020 and Beyond
Jan 2019: Looking Back at Google’s Research Efforts in 2018
Jan 2018: The Google Brain Team — Looking Back on 2017 (Part 1 of 2) … Part 2
Jan 2017: The Google Brain Team — Looking Back on 2016

Some of the areas I’ve worked on in AI and ML (generally with many collaborators!) include:

Research leadership. Steering the research directions of the Google Brain team, Google Research, and now Google DeepMind (with many others!). See year-end blog post links above for more details about this, which includes advances in things like the Transformer architecture, machine learning systems (DistBelief, TensorFlow, Pathways), TPUs, the Inception model, word2vec, seq2seq models, neural machine translation, distillation, neural architecture search/AutoML, RankBrain, BERT, TensorFlow, JAX, Pathways, PaLM, PaLM 2, PaLI, PaLM-E, MedPalm, NeRF, quantum computing advances, ML for chip design, computational photography (e.g. Night Sight & Magic Eraser), flood forecasting, Responsible AI research areas like bias, fairness and interpretability, medical diagnostics, auction theory, open source software and datasets, accessibility, weather forecasting, ML for robotics, connectomics, genomics, and more, as well as research impact in products across nearly all of Google, including Search, Ads, YouTube, GMail, Workspace, Maps, News, Photos, Translate, Android, Cloud, Pixel, Waymo, and many more products.
Computer systems for ML. The design and implementation of three generations of systems for training and deploying of deep learning models: DistBelief, TensorFlow, and Pathways.

In DistBelief, we explored large-scale, highly distributed systems and asynchronous training algorithms to enable ML models to be trained on large amounts of data, even on the relatively slow, non-ML-optimized hardware of the time (we trained models with 2B non-embedding parameters at a time when the largest models reported in the literature were 10M to 50M parameters). The system was used for hundreds of projects within Google and had widespread use across many Google products. Some of the earliest research work we did using DistBelief was exploring unsupervised learning on video frames to see what sorts of representations would emerge, in Building high-level features using large scale unsupervised learning, a.k.a "the cat neuron paper". We also used DistBelief to develop word2vec, various speech recognition models, multimodal work like DeViSE, and early embedding models like RankBrain.
TensorFlow: I was one of the primary designers and implementors of the initial TensorFlow system. I made the case that we should open-source Tensorflow, and we released it as an open source project in 2015, hosted on GitHub. It is used by millions of researchers and developers all over the world for exploring and creating ML and AI systems on platforms ranging from tiny embedded systems, to phones, desktop computers, and ML supercomputers. For detailed papers on TensorFlow, see Tensorflow: Large-scale machine learning on heterogeneous distributed systems (white paper) and TensorFlow: A System for Large-Scale Machine Learning (OSDI 2016).
Pathways is designed to support large-scale, multimodal, sparse architectures that are capable of solving thousands or millions of tasks. I was one of the original designers and implementers, and a paper about the systems research aspects of Pathways appeared in MLSys 2022 as Pathways: Asynchronous Distributed Dataflow for ML. The underlying system software has been used for work like the PaLM language models (which underlie work like Med-PaLM, PaLM-E for robotics, PaLI, and other downstream uses).
Language modeling. I have worked on many different projects related to language modeling, starting with work in 2007 that trained 300 billion parameter language models on trillions of tokens of text (Large language models in machine translation), demonstrating significant improvements in translation quality.
I was a co-author on a pair of papers that introduced an approach of learning distributed representations of words that is now commonly called word2vec (Efficient estimation of word representations in vector space and Distributed representations of words and phrases and their compositionality).
I was one of many who helped to convert the Google Translate system over to using a neural machine translation system, with further significant gains to translation quality. See Google’s neural machine translation system: Bridging the gap between human and machine translation (2016) and Google’s multilingual neural machine translation system: Enabling zero-shot translation. Gideon Lewis-Kraus of The NY Times magazine wrote an in-depth feature about the rollout of the neural machine translation system in Google Translate in The Great AI Awakening.
Part of the infrastructure work on Pathways is designed to enable scaling training of larger models on larger and more diverse datasets. I worked on the PaLM language model work, and I am one of the co-leads of the Gemini effort, which is building next-generation multimodal models that can use tools and APIs to enable more capable models that can be used in a variety of Google products and application areas.
Distillation. I am one of the co-creators of a machine learning technique called distillation, a now-widely-used approach for transferring the knowledge from one neural network to another. It is often used to create smaller, much more efficient models for inference from larger, more unwieldy models, and it can also be used to transfer knowledge from one neural network architecture to a completely different architecture. See Distilling the Knowledge in a Neural Network.
Sparse models. I have been involved in a series of work on sparse model architectures for neural networks, including Outrageously large neural networks: The sparsely-gated mixture-of-experts layer (2017) and Designing Effective Sparse Expert Models. A review of approaches for sparse models appears in A Review of Sparse Expert Models in Deep Learning.
AI for ASIC chip design. I have worked on research on how to apply reinforcement learning to the problem of placement and routing in ASIC chip design. We have shown that it is possible to get performance that is as good or better than human performance on the problem of chip floorplanning in a system that runs in a few hours. Our work here was published in Nature and has been used for multiple generations of Google’s TPU ML accelerators.
ML for healthcare. I have worked on the use of AI and machine learning in healthcare settings. We have done work showing that machine learning on deidentified medical records can produce useful and actionable suggestions for clinicians, published as Scalable and Accurate Deep Learning with Electronic Health Records. The broader research community at Google has also done work on applying machine learning across many different problems in health, including medical imaging diagnostics, genomics, medical note transcription and summarization, and novel sensing (see health sections of year-in-review blog posts above). I’ve also collaborated on a couple of review articles in this space. One assessed some of the most promising directions for integrating deep learning into healthcare settings, and was published in Nature Medicine as A Guide to Deep Learning in Healthcare. The other was a NEJM article titled Machine Learning in Medicine.
ML for computer systems. I have worked with many others on advancing the use of machine learning for tackling computer systems problems. Among these are device placement using reinforcement learning to map abstract ML computation graphs onto a set of physical devices in order to give the best performance (and some follow-on work on a hierarchical version of this), and the use of learned index structures in database systems instead of traditional data structures like B-trees and hash tables.
Energy efficiency of machine learning. I have helped push forward Google’s TPU efforts, identifying fairly early in the widespread use of deep learning that creating efficient systems was going to require building customized accelerator hardware, leading to a long line of TPU processors. TPUv1 (In-datacenter Performance Analysis of a Tensor Processing Unit) targeted inference computations and was about 30X - 80X better performance/Watt than contemporary CPUs and GPUs. Subsequent TPU generations target both training and inference in large-scale ML accelerator systems and are crucial to much of the machine learning research and product applications of ML at Google. They are available to external entities as Google Cloud TPUs.
Carbon emissions of machine learning training is an area that is rife with misinformation due to the prevalence of flawed and inaccurate estimates, so I have also worked with others to correct some of this misinformation and put actual measured data into the literature. See Carbon emissions and large neural network training, especially appendices C and D, and The carbon footprint of machine learning training will plateau, then shrink (if ML researchers adopt best practices). I gave a talk on some of these issues at the 2022 MIT Climate Impacts of Computing and Communications workshop.

While at Google, I've also worked on the following:

Google Search. The design and implementation of five generations of our crawling, indexing, and query serving systems, covering two and three orders of magnitude growth in number of documents searched, number of queries handled per second, and frequency of updates to the system. We did not publish research papers on most aspects of this, but I gave a talk at WSDM'09 about some of the issues involved in building large-scale retrieval systems (slides).

Search ranking algorithms. Some aspects of our search ranking algorithms, notably improved handling for dealing with off-page signals such as anchortext.

Search ranking prototyping system. The design and implementation of prototyping infrastructure for rapid development and experimentation with new ranking algorithms.

MapReduce. The design and implementation of MapReduce, a system for simplifying the development of large-scale data processing applications. A paper about MapReduce appeared in OSDI'04. MapReduce is used extensively within Google, and provided the inspiration for external open-source projects like Hadoop, as well as follow-on projects like Flume.
BigTable. The design and implementation of BigTable, a large-scale semi-structured storage system used underneath a number of Google products. A paper about BigTable appeared in OSDI'06. BigTable is used by hundreds of teams at Google and sits underneath dozens of products. It is available externally as Cloud Bigtable.
Spanner. The design and implementation of Spanner, a geographically-distributed worldwide storage system that can provide strong consistency guarantees through the use of Paxos and highly synchronized clocks in multiple data centers. A paper about Spanner appeared in OSDI’12. Spanner is used extensively for hundreds of projects within Google, underlies a large fraction of our products, and is available for external uses as Google’s Cloud Spanner product.
Google Ads. I was part of a group of three people who did the design and implementation of the initial version of Google's advertising serving system.

AdSense. The initial development of Google's AdSense for Content product (involving both the production serving system design and implementation as well as work on developing and improving the quality of ad selection based on the contents of pages).

Protocol buffers. The development of Protocol Buffers, a way of encoding structured data in an efficient yet extensible format, and a compiler that generates convenient wrappers for manipulating the objects in a variety of languages. Protocol Buffers are used extensively at Google for almost all RPC protocols, and for storing structured information in a variety of persistent storage systems. A version of the protocol buffer implementation has been open-sourced and is available at https://github.com/protocolbuffers/protobuf/, and a developer site with documentation and more details is at https://protobuf.dev/.

Google News. Some of the initial production serving system work for the Google News product, working with Krishna Bharat to move the prototype system he put together into a deployed system.
Job scheduling system. The design and implementation of the first generation of our automated job scheduling system for managing a cluster of machines.

Timeseries analysis system. The initial design and implementation of a system for analyzing complex timeseries data. This system is used extensively by dozens of Google teams to support various use cases like suggested completions, recommendations, etc. The system is available for Cloud customers to analyze their own datasets via the Timeseries Insights API.
Google Translate. Some of the production system design for Google Translate, our statistical machine translation system. In particular, I designed and implemented a system for distributed high-speed access to very large language models (too large to fit in memory on a single machine), and then later helped with the transition to using neural machine translation models.

LevelDB. The design and implementation of LevelDB, a high performance key-value store that we released as an open-source project. It is used in a wide variety of projects including Google Chrome.
Code search. Some internal tools to make it easy to rapidly search our internal source code repository. Many of the ideas from this internal tool were incorporated into our Google Code Search product, including the ability to use regular expressions for searching large corpora of source code.

I enjoy developing software with great colleagues, and I've been fortunate to have worked with many wonderful and talented people on all of my work here at Google. To help ensure that Google continues to hire people with excellent technical skills, I've also been fairly involved in our engineering hiring process.

I received a Ph.D. in computer science from the University of Washington in 1996, working on compiler optimizations for object-oriented languages advised by Craig Chambers. I received a B.S. in computer science and economics (summa cum laude) from the University of Minnesota in 1990 (doing honors theses on parallel training of neural networks and the economic impact of HIV/AIDS).

From 1996 to 1999, I worked for Digital Equipment Corporation's Western Research Lab in Palo Alto, where I worked on low-overhead profiling tools, design of profiling hardware for out-of-order microprocessors, and web-based information retrieval. From 1990 to 1991, I worked for the World Health Organization's Global Programme on AIDS, developing software to do statistical modeling, forecasting, and analysis of the HIV pandemic. In high school and during the summers in college, I worked first at the Centers for Disease Control and later at the World Health Organization developing a series of versions of software called Epi Info (wikipedia) for analyzing epidemiological data (still one of my most cited works).

In 2009, I was elected to the National Academy of Engineering, and in 2016, I was elected as a member of the American Academy of Arts and Sciences. I was also named a Fellow of the Association for Computing Machinery (ACM) and a Fellow of the American Association for the Advancement of Sciences (AAAS). I am a recipient of the ACM Prize in Computing (2012, with my long-time colleague Sanjay Ghemawat), the IEEE John von Neumann medal, and the Mark Weiser Award.

James Somers of the New Yorker wrote a delightful article in 2018 about me and my long-time collaborator Sanjay Ghemawat and how we work together: The Friendship That Made Google Huge.

Selected slides/talks:

Note that talks with similar titles sometimes end up having different mixes of content.

MIT Climate Impacts of Computing and Communications workshop, April 2022: Sustainable Computation and Machine Learning Platforms at Google
58th Design Automation Conference keynote, January, 2022: The Potential of Machine Learning for Hardware Design
TED talk, 2021: AI isn't as smart as you think -- but it could be
Humans of AI discussion, 2021: S1E17: Jeff Dean with Devi Parikh on Humans of AI: Stories, Not Stats
Virtual talk at TU @ Berlin, April, 2021: Tackling Grand Challenge Engineering Problems with Deep Learning
Khipu 2019, November, 2019: Deep Learning to Solve Challenging Problems
UW Allen School Distinguished Lecture, October, 2019: Deep Learning to Solve Challenging Problems
Stanford Medicine Big Data | Precision Health conference keynote, 2019: AI in Healthcare
Berkeley EECS Colloquium, November, 2018: Deep Learning to Solve Challenging Problems
Heidelberg Laureate Forum, September, 2018: Deep Learning and the Grand Engineering Challenges
ETH Zurich Lecture, September, 2018: Deep Learning to Solve Challenging Problems
Heidelberg University talk, September, 2018: Deep Learning to Solve Challenging Problems
Heidelberg Laureate Forum interview of me by Dr. Tom Crawford, September, 2018: Interview: How I Got Started Programming
Deep Learning Indaba, September 2018: TensorFlow and Real Life Machine Learning (long: ~2 hrs)
SysML 2018 invited talk, February, 2018: Systems and Machine Learning Symbiosis
Talk at YC AI meeting, August, 2017: Building Intelligent Systems with Large Scale Deep Learning (slides)
AI Frontiers Conference, January 2017: Trends and Developments in Deep Learning Research
TEDxLA talk, December, 2016: How Will Artificial Intelligence Affect Your Life
ACM Tech Talks, July, 2016: Large Scale Deep Learning with TensorFlow for Building Intelligent Systems
First Person, Palo Alto Online, March, 2016: First Person: A Conversation with Jeff Dean
UW Distinguished Lecture series, February, 2015: Large-Scale Deep Learning For Building Intelligent Computer Systems
Recommendation Systems (RecSys) keynote, October, 2014: Large Scale Machine Learning for Predictive Tasks (and part 2: there were issues in the recorded live stream so it got split into two)
Berkeley AMPLab Cloud Seminar talk, March, 2012: Achieving Rapid Response Times in Large Online Services
Stanford Computer Science Department Distinguished Computer Scientist Lecture lecture, November, 2010: Building Software Systems at Google and Lessons Learned
Symposium on Cloud Computing (SOCC) keynote, June, 2010: Evolution and Future Directions of Large-scale Storage and Computation Systems at Google
Web Search and Data Mining Conference (WSDM) keynote, February, 2009: Challenges in Building Large-Scale Information Retrieval Systems
Google Faculty Summit talk, July, 2008: Some Potential Areas for Future Research
Google I/O Developers Conference, May, 2008: Underneath the Covers at Google: Current Systems and Future Directions
Stanford CS295 class lecture, Spring, 2007: Software Engineering Advice from Building Large-Scale Distributed Systems
UW Colloquium, 2005: BigTable: A Distributed Structured Storage System
UW Colloquium, 2004: Google: A Behind the Scenes Look

Some of the papers I’ve co-authored with awesome colleagues have been fortunate enough to win various awards:

Outstanding Paper Award, MLSys 2022 (for Pathways: Asynchronous Distributed Dataflow for ML)
SIGOPS Hall of Fame Award, 2022 (for Spanner: Google’s Globally Distributed Database System at OSDI 2012)
Best Paper Award, EuroSys 2018 (for Dynamic Control Flow in Large-Scale Machine Learning)
SIGOPS Hall of Fame Award, 2016 (for Bigtable: A Distributed Storage System for Structured Data)
SIGOPS Hall of Fame Award, 2015 (for MapReduce: Simplified Data Processing on Large Clusters)
Best Paper Award, OSDI 2012 (for Spanner: Google’s Globally Distributed Database System)
10-year Retrospective Most Influential Paper Award from OOPSLA 2007 (for Call Graph Construction in Object-Oriented Languages, 1997).
Best Paper Award, OSDI 2006 (for Bigtable: A Distributed Storage System for Structured Data)
10-year Retrospective Most Influential Paper Award from PLDI 2005 (for Selective Specialization for Object-Oriented Languages, 1995)
Best Paper Award, SOSP 1997 (for Continuous Profiling: Where Have All the Cycles Gone?)

Personal:

I've lived in lots of places in my life: Honolulu, HI; Manila, The Phillipines; Boston, MA; West Nile District, Uganda; Boston (again); Little Rock, AR; Hawaii (again); Minneapolis, MN; Mogadishu, Somalia; Atlanta, GA; Minneapolis (again); Geneva, Switzerland; Seattle, WA; and (currently) Palo Alto, CA. I'm hard-pressed to pick a favorite, though: each place has its plusses and minuses.

One of my life goals is to play soccer and basketball on every continent. So far, I've done so in North America, South America, Europe, Asia, Oceania, and Africa. I'm worried that Antarctica might be tough, though.

Research Areas

Authored Publications

Google Publications

Other Publications

Emergent abilities of large language models

Barret Zoph

Colin Raffel

Dale Schuurmans

Dani Yogatama

Denny Zhou

Don Metzler

Ed H. Chi

Jason Wei

Jeff Dean

Liam B. Fedus

Maarten Paul Bosma

Oriol Vinyals

Percy Liang

Sebastian Borgeaud

Tatsunori B. Hashimoto

Yi Tay

TMLR (2022)

PaLM: Scaling Language Modeling with Pathways

Aakanksha Chowdhery

Sharan Narang

Jacob Devlin

Maarten Bosma

Gaurav Mishra

Adam Roberts

Paul Barham

Hyung Won Chung

Charles Sutton

Sebastian Gehrmann

Parker Schuh

Kensen Shi

Sasha Tsvyashchenko

Joshua Maynez

Abhishek Rao

Parker Barnes

Yi Tay

Noam Shazeer

Vinodkumar Prabhakaran

Emily Reif

Nan Du

Ben Hutchinson

Reiner Pope

James Bradbury

Jacob Austin

Michael Isard

Guy Gur-Ari

Pengcheng Yin

Toju Duke

Anselm Levskaya

Sanjay Ghemawat

Sunipa Dev

Henryk Michalewski

Xavier Garcia

Vedant Misra

Kevin Robinson

Liam Fedus

Denny Zhou

Daphne Ippolito

David Luan

Hyeontaek Lim

Barret Zoph

Alexander Spiridonov

Ryan Sepassi

David Dohan

Shivani Agrawal

Mark Omernick

Andrew M. Dai

Thanumalayan Sankaranarayana Pillai

Marie Pellat

Aitor Lewkowycz

Erica Moreira

Rewon Child

Oleksandr Polozov

Katherine Lee

Zongwei Zhou

Xuezhi Wang

Brennan Saeta

Mark Diaz

Orhan Firat

Michele Catasta

Jason Wei

Kathy Meier-Hellstern

Douglas Eck

Jeff Dean

Slav Petrov

Noah Fiedel

arxiv:2204.02311 (2022)

Pathways: Asynchronous Distributed Dataflow for ML

Paul Barham

Aakanksha Chowdhery

Jeff Dean

Sanjay Ghemawat

Steven Hand

Dan Hurt

Michael Isard

Hyeontaek Lim

Ruoming Pang

Sudip Roy

Brennan Saeta

Parker Edward Schuh

Ryan Sepassi

Laurent El Shafey

Chandu Thekkath

Yonghui Wu

MLSys 2022 (2022) (to appear)

The Carbon Footprint of Machine Learning Training Will Level Out and Then Reduce

Chen Liang

Dave Patterson

David Richard So

Jeff Dean

Lluis-Miquel Munguia

Maud Texier

Quoc V. Le

IEEE Computer (2022)

Deep learning-enabled medical computer vision

Andre Esteva

Kat Chou

Serena Yeung

Nikhil Naik

Ali Madani

Ali Mottaghi

Yun Liu

Eric Topol

Jeff Dean

Richard Socher

npj Digital Medicine (2021)

Customization Scenarios for De-identification of Clinical Notes

Avinatan Hassidim

Danny Vainstein

Gavin Edward Bee

Genady Beryozkin

Greg Corrado

Idan Szpektor

Itay Laish

Jack Po

Jeff Dean

Jutta Williams

Kat Chou

Michael Howell

Oren Gilon

Ronit Yael Slyper

Rony Amira

Scott Tyler Ellis

Shlomo Hoory

Tzvika Hartman

Yossi Matias

BMC Medical Informatics and Decision Making (2020)

An Augmented Reality Microscope with Real-time Artificial Intelligence Integration for Cancer Diagnosis

Cameron Chen

Krishna Kumar Gadepalli

Bob MacDonald

Yun Liu

Shiro Kadowaki

Kunal Nagpal

Timo Kohlberger

Jeff Dean

Greg Corrado

Jason Hipp

Craig Mermel

Martin Stumpe

Nature Medicine (2019)

Machine Learning for Medicine

Alvin Rishi Rajkomar

Isaac Kohane

Jeff Dean

New England Journal of Medicine (2019)

Preview

Automatically Charting Symptoms From Patient-Physician Conversations Using Machine Learning

Alvin Rishi Rajkomar

Anjuli Kannan

Claire Cui

Jeff Dean

Kai Chen

Kat Chou

Laura Vardoulakis

Journal of the American Medical Association (2019)

Dynamic Control Flow in Large-Scale Machine Learning

Yuan Yu

Martin Abadi

Paul Barham

Eugene Brevdo

Mike Burrows

Andy Davis

Jeff Dean

Sanjay Ghemawat

Tim Harley

Peter Hawkins

Michael Isard

Manjunath Kudlur

Rajat Monga

Derek Murray

Xiaoqiang Zheng

Proceedings of EuroSys 2018

Scalable and accurate deep learning for electronic health records

Alvin Rishi Rajkomar

Eyal Oren

Kai Chen

Andrew Dai

Nissan Hajaj

Mila Hardt

Peter J. Liu

Xiaobing Liu

Jake Marcus

Mimi Sun

Patrik Per Sundberg

Hector Yee

Kun Zhang

Yi Zhang

Gerardo Flores

Gavin Duggan

Jamie Irvine

Quoc Le

Kurt Litsch

Alex Mossin

Justin Jesada Tansuwan

De Wang

James Wexler

Jimbo Wilson

Dana Ludwig

Samuel Volchenboum

Kat Chou

Michael Pearson

Srinivasan Madabushi

Nigam Shah

Atul Butte

Michael Howell

Claire Cui

Greg Corrado

Jeff Dean

npj Digital Medicine (2018)

The Case for Learned Index Structures

Tim Kraska

Alex Beutel

Ed H. Chi

Jeff Dean

Neoklis Polyzotis

SIGMOD (2018)

Hierarchical Planning for Device Placement

Azalia Mirhoseini

Anna Goldie

Hieu Pham

Benoit Steiner

Quoc V. Le

Jeff Dean

ICLR (2018)

Efficient Neural Architecture Search via Parameters Sharing

Hieu Pham

Melody Guan

Barret Zoph

Quoc V. Le

Jeff Dean

ICML (2018)

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

Noam Shazeer

Azalia Mirhoseini

Krzysztof Maziarz

Andy Davis

Quoc Le

Geoffrey Hinton

Jeff Dean

ICLR (2017)

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi

Cliff Young

Nishant Patil

David Patterson

Gaurav Agrawal

Raminder Bajwa

Sarah Bates

Suresh Bhatia

Nan Boden

Al Borchers

Rick Boyle

Pierre-luc Cantin

Clifford Chao

Chris Clark

Jeremy Coriell

Mike Daley

Matt Dau

Jeffrey Dean

Ben Gelb

Tara Vazir Ghaemmaghami

Rajendra Gottipati

William Gulland

Robert Hagmann

C. Richard Ho

Doug Hogberg

John Hu

Robert Hundt

Dan Hurt

Julian Ibarz

Aaron Jaffey

Alek Jaworski

Alexander Kaplan

Harshit Khaitan

Andy Koch

Naveen Kumar

Steve Lacy

James Laudon

James Law

Diemthu Le

Chris Leary

Zhuyuan Liu

Kyle Lucke

Alan Lundin

Gordon MacKean

Adriana Maggiore

Maire Mahony

Kieran Miller

Rahul Nagarajan

Ravi Narayanaswami

Ray Ni

Kathy Nix

Thomas Norrie

Mark Omernick

Narayana Penukonda

Andy Phelps

Jonathan Ross

ISCA (2017) (to appear)

Device Placement Optimization with Reinforcement Learning

Azalia Mirhoseini

Hieu Pham

Quoc Le

Mohammad Norouzi

Samy Bengio

Benoit Steiner

Yuefeng Zhou

Naveen Kumar

Rasmus Larsen

Jeff Dean

ICML (2017)

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

Yonghui Wu

Mike Schuster

Zhifeng Chen

Quoc V. Le

Mohammad Norouzi

Wolfgang Macherey

Maxim Krikun

Yuan Cao

Qin Gao

Klaus Macherey

Jeff Klingner

Apurva Shah

Melvin Johnson

Xiaobing Liu

Łukasz Kaiser

Stephan Gouws

Yoshikiyo Kato

Taku Kudo

Hideto Kazawa

Keith Stevens

George Kurian

Nishant Patil

Wei Wang

Cliff Young

Jason Smith

Jason Riesa

Alex Rudnick

Oriol Vinyals

Greg Corrado

Macduff Hughes

Jeffrey Dean

CoRR, vol. abs/1609.08144 (2016)

TensorFlow: A system for large-scale machine learning

Martin Abadi

Paul Barham

Jianmin Chen

Zhifeng Chen

Andy Davis

Jeffrey Dean

Matthieu Devin

Sanjay Ghemawat

Geoffrey Irving

Michael Isard

Manjunath Kudlur

Josh Levenberg

Rajat Monga

Sherry Moore

Derek G. Murray

Benoit Steiner

Paul Tucker

Vijay Vasudevan

Pete Warden

Martin Wicke

Yuan Yu

Xiaoqiang Zheng

12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), USENIX Association (2016), pp. 265-283

Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation

Melvin Johnson

Mike Schuster

Quoc V. Le

Maxim Krikun

Yonghui Wu

Zhifeng Chen

Nikhil Thorat

Fernanda Viégas

Martin Wattenberg

Greg Corrado

Macduff Hughes

Jeffrey Dean

Google (2016)

The Beckman report on database research

Daniel Abadi

Rakesh Agrawal

Anastasia Ailamaki

Magdalena Balazinska

Philip A. Bernstein

Michael J. Carey

Surajit Chaudhuri

Jeffrey Dean

AnHai Doan

Michael J. Franklin

Johannes Gehrke

Laura M. Haas

Alon Y. Halevy

Joseph M. Hellerstein

Yannis E. Ioannidis

H. V. Jagadish

Donald Kossmann

Samuel Madden

Sharad Mehrotra

Tova Milo

Jeffrey F. Naughton

Raghu Ramakrishnan

Volker Markl

Christopher Olston

Beng Chin Ooi

Christopher Ré

Dan Suciu

Michael Stonebraker

Todd Walter

Jennifer Widom

Commun. ACM, vol. 59 (2016), pp. 92-99

Preview

Large-Scale Deep Learning For Building Intelligent Computer Systems

Jeffrey Dean

WSDM (2016), pp. 1

Preview

The rise of cloud computing systems

Jeffrey Dean

SOSP History Day (2015), 12:1-12:40

Preview

Distilling the Knowledge in a Neural Network

Geoffrey Hinton

Oriol Vinyals

Jeffrey Dean

NIPS Deep Learning and Representation Learning Workshop (2015)

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Martín Abadi

Ashish Agarwal

Paul Barham

Eugene Brevdo

Zhifeng Chen

Craig Citro

Greg Corrado

Andy Davis

Jeffrey Dean

Matthieu Devin

Sanjay Ghemawat

Ian Goodfellow

Andrew Harp

Geoffrey Irving

Michael Isard

Yangqing Jia

Rafal Jozefowicz

Lukasz Kaiser

Manjunath Kudlur

Josh Levenberg

Dan Mané

Rajat Monga

Sherry Moore

Derek Murray

Chris Olah

Mike Schuster

Jonathon Shlens

Benoit Steiner

Ilya Sutskever

Kunal Talwar

Paul Tucker

Vincent Vanhoucke

Vijay Vasudevan

Fernanda Viégas

Oriol Vinyals

Pete Warden

Martin Wattenberg

Martin Wicke

Yuan Yu

Xiaoqiang Zheng

tensorflow.org (2015)

The Beckman Report on Database Research

Daniel J. Abadi

Rakesh Agrawal

Anastasia Ailamaki

Magdalena Balazinska

Philip A. Bernstein

Michael J. Carey

Surajit Chaudhuri

Jeffrey Dean

AnHai Doan

Michael J. Franklin

Johannes Gehrke

Laura M. Haas

Alon Y. Halevy

Joseph M. Hellerstein

Yannis E. Ioannidis

H. V. Jagadish

Donald Kossmann

Samuel Madden

Sharad Mehrotra

Tova Milo

Jeffrey F. Naughton

Raghu Ramakrishnan

Volker Markl

Christopher Olston

Beng Chin Ooi

Christopher Ré

Dan Suciu

Michael Stonebraker

Todd Walter

Jennifer Widom

SIGMOD Record, vol. 43 (2014), pp. 61-70

Preview

Large Scale Deep Learning

Jeffrey Dean

Tsinghua University (2014)

Zero-Shot Learning by Convex Combination of Semantic Embeddings

Mohammad Norouzi

Tomas Mikolov

Samy Bengio

Yoram Singer

Jonathon Shlens

Andrea Frome

Greg Corrado

Jeffrey Dean

International Conference on Learning Representations (2014)

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov

Ilya Sutskever

Kai Chen

Greg Corrado

Jeffrey Dean

Neural and Information Processing System (NIPS) (2013)

DeViSE: A Deep Visual-Semantic Embedding Model

Andrea Frome

Greg Corrado

Jonathon Shlens

Samy Bengio

Jeffrey Dean

Marc’Aurelio Ranzato

Tomas Mikolov

Neural Information Processing Systems (NIPS) (2013)

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov

Kai Chen

Greg S. Corrado

Jeffrey Dean

International Conference on Learning Representations (2013)

Using Web Co-occurrence Statistics for Improving Image Categorization

Samy Bengio

Jeffrey Dean

Dumitru Erhan

Eugene Ie

Quoc Le

Andrew Rabinovich

Jonathon Shlens

Yoram Singer

arXiv (2013)

Multilingual acoustic models using distributed deep neural networks

Georg Heigold

Vincent Vanhoucke

Andrew Senior

Patrick Nguyen

Marc'aurelio Ranzato

Matthieu Devin

Jeff Dean

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Vancouver, CA (2013)

Spanner: Google's Globally Distributed Database

James C. Corbett

Jeffrey Dean

Michael Epstein

Andrew Fikes

Christopher Frost

J. J. Furman

Sanjay Ghemawat

Andrey Gubarev

Christopher Heiser

Peter Hochschild

Wilson C. Hsieh

Sebastian Kanthak

Eugene Kogan

Hongyi Li

Alexander Lloyd

Sergey Melnik

David Mwaura

David Nagle

Sean Quinlan

Rajesh Rao

Lindsay Rolig

Yasushi Saito

Michal Szymaniak

Christopher Taylor

Ruth Wang

Dale Woodford

ACM Trans. Comput. Syst., vol. 31 (2013), pp. 8

Preview

The Tail at Scale

Jeffrey Dean

Luiz André Barroso

Communications of the ACM, vol. 56 (2013), pp. 74-80

On Rectified Linear Units For Speech Processing

M.D. Zeiler

M. Ranzato

R. Monga

M. Mao

K. Yang

Q.V. Le

P. Nguyen

A. Senior

V. Vanhoucke

J. Dean

G.E. Hinton

38th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver (2013)

Large Scale Distributed Deep Networks

Jeffrey Dean

Greg S. Corrado

Rajat Monga

Kai Chen

Matthieu Devin

Quoc V. Le

Mark Z. Mao

Marc’Aurelio Ranzato

Andrew Senior

Paul Tucker

Ke Yang

Andrew Y. Ng

NIPS (2012)

Building high-level features using large scale unsupervised learning

Quoc Le

Marc'Aurelio Ranzato

Rajat Monga

Matthieu Devin

Kai Chen

Greg Corrado

Jeff Dean

Andrew Ng

International Conference in Machine Learning (2012)

Achieving Rapid Response Times in Large Online Services

Jeffrey Dean

Talk given at Berkeley AMPLab Cloud Seminar, March 26, 2012 (2012)

Spanner: Google's Globally-Distributed Database

James C. Corbett

Jeffrey Dean

Michael Epstein

Andrew Fikes

Christopher Frost

JJ Furman

Sanjay Ghemawat

Andrey Gubarev

Christopher Heiser

Peter Hochschild

Wilson Hsieh

Sebastian Kanthak

Eugene Kogan

Hongyi Li

Alexander Lloyd

Sergey Melnik

David Mwaura

David Nagle

Sean Quinlan

Rajesh Rao

Lindsay Rolig

Dale Woodford

Yasushi Saito

Christopher Taylor

Michal Szymaniak

Ruth Wang

OSDI (2012)

Evolution and Future Directions of Large-scale Storage and Computation Systems at Google

Jeffrey Dean

Keynote talk given at 1st Symposium on Cloud Computing (SOCC), ACM, pp. 1-1

Evolution and future directions of large-scale storage and computation systems at Google

Jeffrey Dean

SoCC '10: Proceedings of the 1st ACM symposium on Cloud computing, ACM, New York, NY, USA (2010), pp. 1-1

Preview

MapReduce: a flexible data processing tool

Jeffrey Dean

Sanjay Ghemawat

Commun. ACM, vol. 53 (2010), pp. 72-77

Preview

Back-off Language Model Compression

Boulos Harb

Ciprian Chelba

Jeffrey Dean

Sanjay Ghemawat

Proceedings of Interspeech 2009, International Speech Communication Association (ISCA), pp. 325-355

Challenges in building large-scale information retrieval systems: invited talk

Jeffrey Dean

WSDM '09: Proceedings of the Second ACM International Conference on Web Search and Data Mining, ACM, New York, NY, USA (2009), pp. 1-1

Distributed Programming with MapReduce

Jeffrey Dean

Sanjay Ghemawat

Beautiful Code, O'Reilly (2007), Chapter 23

Preview

Large Language Models in Machine Translation

Thorsten Brants

Ashok C. Popat

Peng Xu

Franz J. Och

Jeffrey Dean

Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp. 858-867

Preview

Bigtable: A Distributed Storage System for Structured Data

Fay Chang

Jeffrey Dean

Sanjay Ghemawat

Wilson C. Hsieh

Deborah A. Wallach

Mike Burrows

Tushar Chandra

Andrew Fikes

Robert E. Gruber

7th USENIX Symposium on Operating Systems Design and Implementation (OSDI), {USENIX} (2006), pp. 205-218

Experiences with MapReduce, an abstraction for large-scale computation

Jeffrey Dean

Proc. 15th International Conference on Parallel Architectures and Compilation Techniques, ACM, Seattle, WA (2006), pp. 1

Preview

MapReduce: Simplified Data Processing on Large Clusters

Jeffrey Dean

Sanjay Ghemawat

OSDI'04: Sixth Symposium on Operating System Design and Implementation, San Francisco, CA (2004), pp. 137-150

Web Search for a Planet: The Google Cluster Architecture

Luiz Andre Barroso

Jeffrey Dean

Urs Hölzle

IEEE Micro, vol. 23 (2003), pp. 22-28

A Comparison of Techniques to Find Mirrored Hosts on the WWW

Krishna Bharat

Andrei Z. Broder

Jeffrey Dean

Monika Rauch Henzinger

JASIS, vol. 51 (2000), pp. 1114-1122

Preview

MapReduce and Other Building Blocks for Large-Scale Distributed Systems at Google

Jeffrey Dean

USENIX Annual Technical Conference (2007)

Bigtable: A Distributed Storage System for Structured Data (Awarded Best Paper!)

Fay Chang

Jeffrey Dean

Sanjay Ghemawat

Wilson C. Hsieh

Deborah A. Wallach

Michael Burrows

Tushar Chandra

Andrew Fikes

Robert Gruber

OSDI (2006), pp. 205-218

LPI Linux certification - in a nutshell: a desktop quick reference: pass the LPIC-1 and LPIC-2 exams, 2nd Edition

Steven Pritchard

Bruno Gomes Pessanha

Nicolai Langfeldt

James Stanger

Jeffrey Dean

O'Reilly (2006), I-XVIII, 1-961

LPI Linux certification in a nutshell - a desktop quick reference: covers exams 101 102 for LPI level 1

Jeffrey Dean

O'Reilly (2001), I-XVI, 1-551

The Swift Java Compiler: Design and Implementation

Daniel J. Scales

Keith H. Randall

Sanjay Ghemawat

Jeffrey Dean

HP Labs Technical Reports (2000), pp. 26

A Comparison of Techniques to Find Mirrored Hosts on the WWW

Krishna Bharat

Andrei Z. Broder

Jeffrey Dean

Monika Rauch Henzinger

IEEE Data Eng. Bull., vol. 23 (2000), pp. 21-26

Finding Related Pages in the World Wide Web

Jeffrey Dean

Monika Rauch Henzinger

Computer Networks, vol. 31 (1999), pp. 1467-1479

Control of Walking in the Stick Insect: From Behavior and Physiology to Modeling

Jeffrey Dean

Thomas Kindermann

Josef Schmitz

Michael Schumm

Holk Cruse

Auton. Robots, vol. 7 (1999), pp. 271-288

A Comparison of Techniques to Find Mirrored Hosts on the WWW

Krishna Bharat

Andrei Z. Broder

Jeffrey Dean

Monika Rauch Henzinger

WOWS (1999), pp. 2-12

Hardware Support for Out-of-Order Instruction Profiling on Alpha 21264a

J. Anderson

L. Berc

Jeffrey Dean

Sanjay Ghemawat

S. Leung

M. Litchenberg

M Vandevoorde

G. Verns

C. Waldspurger

W. Weihl

J. White

HOTCHIPS 99, IEEE (1999)

Transparent, Low-Overhead Profiling on Modern Processors

Jennifer Anderson

Lance Berc

George Chrysos

Jeffrey Dean

Sanjay Ghemawat

Jamey Hicks

Shun-tak Leung

mitch Lichtenberg

Mark Vendevoorde

Carl A. Waldspurger

William E. Weihl

Workshop on Profile and Feedback-Directed Compilation, Paris (1998)

ProfileMe: Hardware Support for Instruction-Level Profiling on Out-of-Order Processors

Jeffrey Dean

James E. Hicks

Carl A. Waldspurger

William E. Weihl

George Chrysos

Proc. 30th Annual Symposium on Microarchitecture (1997)

ProfileMe: hardware support for instruction-level profiling on out-of-order processors

Jeffrey Dean

James E. Hicks

Carl A. Waldspurger

William E. Weihl

George Chrysos

MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, IEEE Computer Society, Washington, DC, USA (1997), pp. 292-302

Call Graph Construction in Object-Oriented Languages

David Grove

Greg DeFouw

Jeffrey Dean

Craig Chambers

OOPSLA (1997), pp. 108-124

ProfileMe: Hardware Support for Instruction-Level Profiling on Out-of-Order Processors

Jeffrey Dean

James E. Hicks

Carl A. Waldspurger

William E. Weihl

George Z. Chrysos

MICRO (1997), pp. 292-302

Continuous Profiling: Where Have All the Cycles Gone?

Jennifer-Ann M. Anderson

Lance M. Berc

Jeffrey Dean

Sanjay Ghemawat

Monika Rauch Henzinger

Shun-Tak Leung

Richard L. Sites

Mark T. Vandevoorde

Carl A. Waldspurger

William E. Weihl

ACM Transactions on Computer Systems, vol. 15 (1997), pp. 357-390

Vortex: An Optimizing Compiler for Object-Oriented Languages

Jeffrey Dean

Greg DeFouw

David Grove

Vassily Litvinov

Craig Chambers

OOPSLA, San Jose, CA (1996), pp. 83-100

Whole-program optimization of object-oriented languages

Jeffrey Adgate Dean

Ph.D. Thesis, University of Washington (1996)

Expressive, Efficient Instance Variables

Jeffrey Dean

David Grove

Craig Chambers

Vassily Litvinov

University of Washington (1996)

Simplifying Neural Networks for Controlling Walking by Exploiting Physical Properties

Holk Cruse

Christian Bartling

Jeffrey Dean

Thomas Kindermann

Josef Schmitz

Michael Schumm

Hendrik Wagner

ICANN (1996), pp. 433-438

Optimization of Object-Oriented Programs Using Static Class Hierarchy Analysis

Jeffrey Dean

David Grove

Craig Chambers

ECOOP (1995), pp. 77-101

A Framework for Selective Recompilation in the Presence of Complex Intermodule Dependencies

Craig Chambers

Jeffrey Dean

David Grove

ICSE, Seattle, Washington (1995), pp. 221-230

Selective Specialization for Object-Oriented Languages

Jeffrey Dean

Craig Chambers

David Grove

PLDI, La Jolla, CA (1995), pp. 93-102

Profile-Guided Receiver Class Prediction

David Grove

Jeffrey Dean

Charles Garrett

Craig Chambers

OOPSLA, Austin, TX (1995), pp. 108-123

Towards Better Inlining Decisions Using Inlining Trials

Jeffrey Dean

Craig Chambers

Proceedings of the 1994 Conference on Lisp and Functional Programming (L&FP'94), Orlando, FL, pp. 273-282

Identifying Profitable Specialization in Object-Oriented Languages

Jeffrey Dean

Craig Chambers

David Grove

Workshop on Partial Evaluation & Semantics-based Program Manipulation, Orlando, FL (1994), pp. 85-96

Epi Info: A General-purpose Microcomputer Program for Public Health Information Systems

Andrew Dean

Jeffrey Dean

Anthony Burton

Richard Dicker

American Journal of Preventative Medicine, vol. 7 (1991), pp. 178-182

Software for Data Management and Analysis in Epidemiology

A. H. Burton

Jeffrey Dean

Andrew Dean

Journal of the World Health Forum, vol. 11, no. 1 (1990), pp. 75-77

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Jeffrey Dean

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Jeffrey Dean

Research Areas

Filter by:

Year

Team

Research Area

Join us

AI/ML Foundations  & Capabilities