Probabilistic Retrieval Based On Document Representations

Wolfgang Macherey; Joerg Viechtbauer; Hermann Ney

Probabilistic Retrieval Based On Document Representations

Wolfgang Macherey

Joerg Viechtbauer

Hermann Ney

Int. Conf. on Spoken Language Processing (2002), pp. 1481-1484

Download Google Scholar

Abstract

Accessing information in multimedia databases encompasses a wide range of applications in which spoken document retrieval (SDR) plays an important role. In the recent past, research increasingly focused on the development of heuristic and probabilistic retrieval metrics that are suitable for retrieving spoken documents. So far, many heuristic retrieval metrics, eg the SMART-2 metric, have been proven to be more efficient than most advanced statistical approaches to SDR. In this paper, we propose a new probabilistic approach that is based on interpolations between document representations. This approach can be interpreted as a sort of nearest neighbor concept between documents, where a query is treated as a document. Experiments performed on the TREC-7 and TREC-8 SDR task show comparable or even better results than the SMART-2 metric.

Research Areas

Information retrieval

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Probabilistic Retrieval Based On Document Representations

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs