Andrew M. Dai

Andrew is a software engineer at Google. Prior to that he completed a PhD in machine learning at the University of Edinburgh and a MA in computer science at the University of Cambridge.

Research Areas

Authored Publications

Google Publications

Other Publications

MaMMUT: A Simple Vision-Encoder Text-Decoder Architecture for MultiModal Tasks

Weicheng Kuo

AJ Piergiovanni

Dahun Kim

Xiyang Luo

Ben Caine

Wei Li

Abhijit Ogale

Luowei Zhou

Andrew Dai

Zhifeng Chen

Claire Cui

Anelia Angelova

Transactions on Machine Learning Research (2023)

Sparsely Activated Language Models are Efficient In-Context Learners

Adams Yu

Andrew Dai

Barret Richard Zoph

Claire Cui

Dmitry (Dima) Lepikhin

Emma Wang

Kathy Meier-Hellstern

Kellie Webster

Kevin Robinson

Kun Zhang

Liam B. Fedus

Lucas Dixon

Maarten Paul Bosma

Marie Pellat

Maxim Krikun

Nan Du

Orhan Firat

Quoc V. Le

Simon Tong

Tao Wang

Toju Duke

Yanping Huang

Yanqi Zhou

Yonghui Wu

Yuanzhong Xu

Zhifeng Chen

Zongwei Zhou

(2022)

Mind's Eye: Grounded Language Model Reasoning through Simulation

Ruibo Liu

Jason Wei

Shixiang Shane Gu

Soroush Vosoughi

Claire Cui

Denny Zhou

Andrew Dai

ICLR 2023 (2022)

PaLM: Scaling Language Modeling with Pathways

Aakanksha Chowdhery

Sharan Narang

Jacob Devlin

Maarten Bosma

Gaurav Mishra

Adam Roberts

Paul Barham

Hyung Won Chung

Charles Sutton

Sebastian Gehrmann

Parker Schuh

Kensen Shi

Sasha Tsvyashchenko

Joshua Maynez

Abhishek Rao

Parker Barnes

Yi Tay

Noam Shazeer

Vinodkumar Prabhakaran

Emily Reif

Nan Du

Ben Hutchinson

Reiner Pope

James Bradbury

Jacob Austin

Michael Isard

Guy Gur-Ari

Pengcheng Yin

Toju Duke

Anselm Levskaya

Sanjay Ghemawat

Sunipa Dev

Henryk Michalewski

Xavier Garcia

Vedant Misra

Kevin Robinson

Liam Fedus

Denny Zhou

Daphne Ippolito

David Luan

Hyeontaek Lim

Barret Zoph

Alexander Spiridonov

Ryan Sepassi

David Dohan

Shivani Agrawal

Mark Omernick

Andrew M. Dai

Thanumalayan Sankaranarayana Pillai

Marie Pellat

Aitor Lewkowycz

Erica Moreira

Rewon Child

Oleksandr Polozov

Katherine Lee

Zongwei Zhou

Xuezhi Wang

Brennan Saeta

Mark Diaz

Orhan Firat

Michele Catasta

Jason Wei

Kathy Meier-Hellstern

Douglas Eck

Jeff Dean

Slav Petrov

Noah Fiedel

arxiv:2204.02311 (2022)

Finetuned Language Models are Zero-Shot Learners

Jason Wei

Maarten Paul Bosma

Vincent Zhao

Kelvin Guu

Adams Wei Yu

Brian Lester

Nan Du

Andrew Mingbo Dai

Quoc V. Le

International Conference on Learning Representations (2022)

Evaluation of US State-Based Policy Interventions on Social Distancing Using Aggregated Mobility Data during the COVID-19 Pandemic

Gregory Alexander Wellenius

Swapnil Suresh Vispute

Valeria Espinosa

Alex Fabrikant

Thomas Tsai

Jonathan Hennessy

Andrew Dai

Brian Patrick Williams

Krishna Kumar Gadepalli

Adam Boulanger

Adam Pearce

Chaitanya Kamath

Arran Schlosberg

Catherine Bendebury

Chinmoy Mandayam

Charlotte Stanton

Shailesh Bavadekar

Christopher David Pluntke

Damien Desfontaines

Benjamin H. Jacobson

Zan Armstrong

Bryant Gipson

Royce Wilson

Andrew Philip Widdowson

Katherine Chou

Andrew Nathaniel Oplinger

Tomer Shekel

Ashish K. Jha

Evgeniy Gabrilovich

Nature Communications (2021)

Training independent subnetworks for robust prediction

Marton Havasi

Rodolphe Jenatton

Stanislav Fort

Jeremiah Liu

Jasper Roland Snoek

Balaji Lakshminarayanan

Andrew Mingbo Dai

Dustin Tran

International Conference on Learning Representations (2021)

Predicting inpatient medication orders from electronic health record data

Kathryn Rough

Andrew M. Dai

Kun Zhang

Emily Xue

Laura M. Vardoulakis

Atul J. Butte

Claire Cui

Michael D. Howell

Alvin Rajkomar

Clinical Pharmacology and Therapeutics (2020)

Google COVID-19 Search Trends Symptoms Dataset: Anonymization Process Description

Akim Kumok

Alex Fabrikant

Andrew Mingbo Dai

Chaitanya Kamath

Charlotte Stanton

Damien Desfontaines

Dennis Kraft

Evgeniy Gabrilovich

Gerardo Flores

Gregory Alexander Wellenius

Ilya Eckstein

Irippuge Milinda Perera

Izhak Shafran

John S. Davis

Karthik Raman

Katie Everett

Krishna Kumar Gadepalli

Masrour Zoghi

Mimi Sun

Rayman Huang

Shailesh Bavadekar

Thomas Ludwig Roessler

Venky Ramachandran

Yael Mayer

Arxiv.org, N/A (2020)

Learning the Graphical Structure of Electronic Health Records with Graph Convolutional Transformer

Edward Choi

Zhen Xu

Yujia Li

Michael W. Dusenberry

Gerardo Flores

Emily Xue

Andrew M. Dai

Association for the Advancement of Artificial Intelligence (AAAI) (2020)

Analyzing the Role of Model Uncertainty for Electronic Health Records

Michael W. Dusenberry

Dustin Tran

Edward Choi

Jonas Kemp

Jeremy Nixon

Ghassen Jerfel

Katherine Heller

Andrew M. Dai

ACM Conference on Health, Inference, and Learning (ACM CHIL) (2020)

Deep State-Space Generative Model For Correlated Time-to-Event Predictions

Yuan Xue

Denny Zhou

Nan Du

Andrew Mingbo Dai

Zhen Xu

Kun Zhang

Claire Cui

ACM KDD 2020

Learnability and Complexity of Quantum Samples

Murphy Yuezhen Niu

Andrew Mingbo Dai

Li Li

Augustus Odena

Zhengli Zhao

Vadim Smelyanskiy

Hartmut Neven

Sergio Boixo

arxiv (2020)

Learn to Select Trajectory Forecast Tasks for Clinical Outcome Prediction

Emily Xue

Nan Du

Anne Mottram

Martin Seneviratne

Andrew Mingbo Dai

NeurIPS 2020

Natural Questions: a Benchmark for Question Answering Research

Tom Kwiatkowski

Jennimaria Palomaki

Olivia Redfield

Michael Collins

Ankur Parikh

Chris Alberti

Danielle Epstein

Illia Polosukhin

Matthew Kelcey

Jacob Devlin

Kenton Lee

Kristina N. Toutanova

Llion Jones

Ming-Wei Chang

Andrew Dai

Jakob Uszkoreit

Quoc Le

Slav Petrov

Transactions of the Association of Computational Linguistics (2019) (to appear)

Improved Hierarchical Patient Classification with Language Model Pretraining over Clinical Notes

Jonas Beachey Kemp

Alvin Rishi Rajkomar

Andrew Mingbo Dai

NeurIPS ML4H – extended abstract (2019)

Gmail Smart Compose: Real-Time Assisted Writing

Andrew Dai

Benjamin Lee

Gagan Bansal

Jackie Tsay

Justin Lu

Mia Chen

Shuyuan Zhang

Tim Sohn

Yinan Wang

Yonghui Wu

Yuan Cao

Zhifeng Chen

KDD 2019 (2019)

Music Transformer: Generating Music with Long-Term Structure

Cheng-Zhi Anna Huang

Ashish Vaswani

Jakob Uszkoreit

Noam Shazeer

Ian Simon

Curtis Hawthorne

Andrew Dai

Matt Hoffman

Monica Dinculescu

Douglas Eck

ICLR (2019)

Embedding Text in Hyperbolic Spaces

Bhuwan Dhingra

Chris Shallue

Mohammad Norouzi

Andrew Dai

George Dahl

NAACL Workshop (2018)

Learning Longer-term Dependencies in RNNs with Auxiliary Losses

Hoang Trieu Trinh

Andrew Dai

Thang Luong

Quoc V. Le

ICML (2018)

Who said what: Modeling individual labelers improves classification

Melody Guan

Varun Gulshan

Andrew Dai

Geoffrey Hinton

AAAI (2018)

MaskGAN: Better Text Generation via Filling in the ____

William Fedus

Ian Goodfellow

Andrew Dai

ICLR (2018)

AirDialogue: An Environment for Goal-Oriented Dialogue Research

Wei Wei

Quoc V. Le

Andrew Dai

Jia Li

Empirical Methods in Natural Language Processing (EMNLP) (2018)

Many Paths to Equilibrium: GANs do not need to decrease a divergence at every step

William Fedus

Mihaela Rosca

Balaji Lakshminarayanan

Andrew Dai

Shakir Mohamed

Ian Goodfellow

ICLR (2018)

Peptide-Spectra Matching with Weak Supervision

Sam Schoenholz

Sean Hackett

Laura Deming

Eugene Melamud

Andrew Dai

Navdeep Jaitly

Fiona McAllister

Jonathon O'Brien

George Dahl

Bryson Bennett

Daphne Koller

arXiv (2018)

Scalable and accurate deep learning for electronic health records

Alvin Rishi Rajkomar

Eyal Oren

Kai Chen

Andrew Dai

Nissan Hajaj

Mila Hardt

Peter J. Liu

Xiaobing Liu

Jake Marcus

Mimi Sun

Patrik Per Sundberg

Hector Yee

Kun Zhang

Yi Zhang

Gerardo Flores

Gavin Duggan

Jamie Irvine

Quoc Le

Kurt Litsch

Alex Mossin

Justin Jesada Tansuwan

De Wang

James Wexler

Jimbo Wilson

Dana Ludwig

Samuel Volchenboum

Kat Chou

Michael Pearson

Srinivasan Madabushi

Nigam Shah

Atul Butte

Michael Howell

Claire Cui

Greg Corrado

Jeff Dean

npj Digital Medicine (2018)

HyperNetworks

David Ha

Andrew Dai

Quoc V. Le

ICLR (2017)

Adversarial Training Methods for Semi-Supervised Text Classification

Takeru Miyato

Andrew M. Dai

Ian Goodfellow

ICLR (2017)

Who Said What: Modelling Individual Labels Improves Classification

Melody Y. Guan

Varun Gulshan

Andrew M. Dai

Geoffrey Hinton

CVPR Workshop (2017)

Virtual Adversarial Training for Semi-Supervised Text Classification

Takeru Miayto

Andrew M. Dai

Ian Goodfellow

arXiv preprint (2016)

Generating Sentences from a Continuous Space

Samuel R. Bowman

Luke Vilnis

Oriol Vinyals

Andrew M. Dai

Rafal Jozefowicz

Samy Bengio

CoNLL (2016)

HyperNetworks

David Ha

Andrew Dai

Quoc Le

ICLR (2016)

Semi-supervised sequence learning

Andrew M. Dai

Quoc V. Le

Advances in Neural Information Processing Systems, NIPS (2015)

Document embedding with paragraph vectors

Andrew M. Dai

Christopher Olah

Quoc V. Le

NIPS Deep Learning Workshop (2014)

Language-independent Compound Splitting with Morphological Operations

Klaus Macherey

Andrew M. Dai

David Talbot

Ashok C. Popat

Franz Och

ACL HLT 2011, pp. 10

No Results Found

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Andrew M. Dai

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Andrew M. Dai

Research Areas

Filter by:

Year

Team

Research Area

Join us

AI/ML Foundations  & Capabilities