An AI system to help scientists write expert-level empirical software

Eser Aygün; Anastasiya Belyaeva; Gheorghe Comanici; Marc Coram; Hao Cui; Jake Garrison; Renee Johnston; Anton Kast; Cory McLean; Peter Norgaard; Zahra Shamsi; David Smalling; James Thompson; Subhashini Venugopalan; Brian Williams; Sarah Martinson; Martyna Plomecka; Lai Wei; Yuchen Zhou; Qian-Ze Zhu; Matthew Abraham; Erica Brand; Anna Bulanova; Jeffrey Cardille; Chris Co; Scott Ellsworth; Grace Joseph; Malcolm Kane; Ryan Krueger; Johan Kartiwa; Dan Liebling; Jackson Cui; Jan-Matthis Lückmann; Paul Raccuglia; Julie Wang; Kat Chou; James Manyika; Yossi Matias; John Platt; Lizzie Dorfman; Shibl Mourad; Michael Brenner

An AI system to help scientists write expert-level empirical software

Eser Aygün

Anastasiya Belyaeva

Gheorghe Comanici

Marc Coram

Hao Cui

Jake Garrison

Renee Johnston

Anton Kast

Cory McLean

Peter Norgaard

Zahra Shamsi

David Smalling

James Thompson

Subhashini Venugopalan

Brian Williams

Sarah Martinson

Martyna Plomecka

Lai Wei

Yuchen Zhou

Qian-Ze Zhu

Matthew Abraham

Erica Brand

Anna Bulanova

Jeffrey Cardille

Chris Co

Scott Ellsworth

Grace Joseph

Malcolm Kane

Ryan Krueger

Johan Kartiwa

Dan Liebling

Jackson Cui

Jan-Matthis Lückmann

Paul Raccuglia

Julie Wang

Kat Chou

James Manyika

Yossi Matias

John Platt

Lizzie Dorfman

Shibl Mourad

Michael Brenner

Nature (2026)

Download Google Scholar

Abstract

The cycle of scientific discovery is frequently bottlenecked by the slow, manual creation of software to support computational experiments. To address this, we present Empirical Research Assistance (ERA), an AI system that creates expert-level scientific software whose goal is to maximize a quality metric. The system uses a Large Language Model (LLM) and Tree Search (TS) to systematically improve the quality metric and intelligently navigate the large space of possible solutions. ERA achieves expert-level results when it explores and integrates complex research ideas from external sources. The effectiveness of tree search is demonstrated across a diverse range of tasks. In bioinformatics, ERA discovered 40 novel methods for single-cell data analysis that outperformed the top human-developed methods on a public leaderboard. In epidemiology, ERA generated 14 models that outperformed the CDC ensemble and all other individual models for forecasting COVID-19 hospitalizations. ERA also produced expert-level software for geospatial analysis, neural activity prediction in zebrafish, and numerical solution of integrals, and a novel rule-based construction for time series forecasting. By devising and implementing novel solutions to diverse tasks, ERA represents a significant step towards accelerating scientific progress.

Keywords: Tree Search, Generative AI, Scorable Scientific Tasks, Empirical Software

Research Areas

Machine intelligence

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

An AI system to help scientists write expert-level empirical software

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs