Kernel-Penalized Regression for Analysis of Microbiome Data

Timothy W. Randolph; Sen Zhao; Wade Copeland; Meredith Hullar; Ali Shojaie

Kernel-Penalized Regression for Analysis of Microbiome Data

Timothy W. Randolph

Sen Zhao

Wade Copeland

Meredith Hullar

Ali Shojaie

Annals of Applied Statistics, 12 (2018), pp. 540-566

Google Scholar

Abstract

The analysis of human microbiome data is often based on dimension reduced graphical displays and clusterings derived from vectors of microbial abundances in each sample. Common to these ordination methods is the use of biologically motivated definitions of similarity. Principal coordinate analysis, in particular, is often performed using ecologically defined distances, allowing analyses to incorporate context-dependent, non-Euclidean structure. In this paper, we go beyond dimension-reduced ordination methods and describe a framework of high-dimensional regression models that extends these distance-based methods. In particular, we use kernel-based methods to show how to incorporate a variety of extrinsic information, such as phylogeny, into penalized regression models that estimate taxon specific associations with a phenotype or clinical outcome. Further, we show how this regression framework can be used to address the compositional nature of multivariate predictors comprised of relative abundances; that is, vectors whose entries sum to a constant. We illustrate this approach with several simulations using data from two recent studies on gut and vaginal microbiomes. We conclude with an application to our own data, where we also incorporate a significance test for the estimated coefficients that represent associations between microbial abundance and a percent fat.

Research Areas

Algorithms and theory

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Kernel-Penalized Regression for Analysis of Microbiome Data

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs