Variance Reduction in Bipartite Experiments through Correlation Clustering

Jean Pouget-Abadie; Kevin Aydin; Warren Schudy; Kay Brodersen; Vahab Mirrokni

Variance Reduction in Bipartite Experiments through Correlation Clustering

Jean Pouget-Abadie

Kevin Aydin

Warren Schudy

Kay Brodersen

Vahab Mirrokni

Thirty-third Conference on Neural Information Processing Systems (2019) (to appear)

Google Scholar

Abstract

Causal inference in randomized experiments typically assumes that the units of randomization and the units of analysis are one and the same. In some applications, however, these two roles are played by distinct entities linked by a bipartite graph. The key challenge in such bipartite settings is how to avoid interference bias, which would typically arise if we simply randomized the treatment at the level of analysis units. One effective way of minimizing interference bias in standard experiments is through cluster randomization, but this design has not been studied in the bipartite setting where conventional clustering schemes can lead to poorly powered experiments. This paper introduces a novel clustering objective and a corresponding algorithm that partitions a bipartite graph so as to maximize the statistical power of a bipartite experiment on that graph. Whereas previous work relied on balanced partitioning, our formulation suggests the use of a correlation clustering objective. We use a publicly-available graph of Amazon user-item reviews to validate our solution and illustrate how it substantially increases the statistical power in bipartite experiments.

Research Areas

Algorithms and theory

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Variance Reduction in Bipartite Experiments through Correlation Clustering

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs