Stochastic Retrieval-Conditioned Reranking

Hamed Zamani; Mike Bendersky; Don Metzler; Honglei Zhuang; Xuanhui Wang

Stochastic Retrieval-Conditioned Reranking

Hamed Zamani

Mike Bendersky

Don Metzler

Honglei Zhuang

Xuanhui Wang

The ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR) 2022

Download Google Scholar

Abstract

The multi-stage cascaded architecture has been adopted by many search engines for efficient and effective retrieval. This architecture consists of a stack of retrieval and reranking models in which efficient retrieval models are followed by effective (neural) learning to rank models. The optimization of these learning to rank models is loosely connected to the early stage retrieval models. In many cases these learning to rank models are often trained in isolation of the early stage retrieval models. This paper draws theoretical connections between the early stage retrieval and late stage reranking models by deriving expected reranking performance conditioned on the early stage retrieval results. Our findings shed light on optimization of both retrieval and reranking models. As a result, we also introduce a novel loss function for training reranking models that leads to significant improvement in multiple public benchmarks.

Research Areas

Information retrieval

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Stochastic Retrieval-Conditioned Reranking

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs