Separate And Attend in Personal Email Search

Yu Meng

Maryam Karimzadehgan

Honglei Zhuang

Don Metzler

Proceedings of the 13th ACM International Conference on Web Search and Data Mining (WSDM) (2020)

Download Google Scholar

Abstract

In personal email search, user queries often impose different requirements on different aspects of the retrieved emails. For example, the query "my recent flight to the US'" requires emails to be ranked based on both textual contents and recency of the email documents, while other queries such as "medical history'" do not impose any constraints on the recency of the email. Recent deep learning-to-rank models for personal email search often directly concatenate dense numerical features with embedded sparse features (e.g, n-gram embeddings). In this paper, we first show with a set of experiments on synthetic datasets that direct concatenation of dense and sparse features does not lead to the optimal search performance of deep neural ranking models.
To effectively incorporate both sparse and dense email features into personal email search ranking, we propose a novel neural model, sepattn. sepattn first builds two separate neural models to learn from sparse and dense features respectively, and then applies an attention mechanism at the prediction level to derive the final prediction from these two models.
We conduct a comprehensive set of experiments on a large-scale email search dataset, and demonstrate that our sepattn model consistently improves the search quality over the baseline models.

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Separate And Attend in Personal Email Search

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs