A Grammar of Data Analysis

Taylor Pospisil
Omkar Muralidharan
Dennis Sun
arxiv (2025)

Abstract

This paper outlines a grammar of data analysis, as distinct from grammars of data manipulation. The primitives of this grammar are metrics and dimensions. We describe a Python implementation of this grammar called Meterstick, which is agnostic to the underlying data source, which may be a DataFrame or a SQL database.
×