What’s your ML test score? A rubric for ML production systems

Eric Breck; Shanqing Cai; Eric Nielsen; Michael Salib; D. Sculley

What’s your ML test score? A rubric for ML production systems

Eric Breck

Shanqing Cai

Eric Nielsen

Michael Salib

D. Sculley

Reliable Machine Learning in the Wild - NIPS 2016 Workshop (2016)

Google Scholar

Abstract

Using machine learning in real-world production systems is complicated by a
host of issues not found in small toy examples or even large offline research
experiments. Testing and monitoring are key considerations for assessing the
production-readiness of an ML system. But how much testing and monitoring is
enough? We present an ML Test Score rubric based on a set of actionable tests to
help quantify these issues.

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

What’s your ML test score? A rubric for ML production systems

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs