Information Retrieval and the Web

The science surrounding search engines is commonly referred to as information retrieval, in which algorithmic principles are developed to match user interests to the best information about those interests.

Google started as a result of our founders' attempt to find the best matching between the user queries and Web documents, and do it really fast. During the process, they uncovered a few basic principles: 1) best pages tend to be those linked to the most; 2) best description of a page is often derived from the anchor text associated with the links to a page. Theories were developed to exploit these principles to optimize the task of retrieving the best documents for a user query.

Search and Information Retrieval on the Web has advanced significantly from those early days: 1) the notion of ""information"" has greatly expanded from documents to much richer representations such as images, videos, etc., 2) users are increasingly searching on their Mobile devices with very different interaction characteristics from search on the Desktops; 3) users are increasingly looking for direct information, such as answers to a question, or seeking to complete tasks, such as appointment booking. Through our research, we are continuing to enhance and refine the world's foremost search engine by aiming to scientifically understand the implications of those changes and address new challenges that they bring.

Recent Publications

Websites Need Your Permission Too – User Sentiment and Decision Making on Web Permission Prompts in Desktop Chrome

Marian Harbach

CHI 2024, ACM (to appear)

Don’t Interrupt Me – A Large-Scale Study of On-Device Permission Prompt Quieting in Chrome

Marian Harbach

Igor Bilogrevic

Enrico Bacis

Serena Chen

Ravjit Uppal

Andy Paicu

Elias Klim

Meggyn Watkins

Balazs Engedy

(2024)

VRDU: A Benchmark for Visually-rich Document Understanding

Zilong Wang

Yichao Zhou

Wei Wei

Chen-Yu Lee

Sandeep Tata

2023 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Conversational Recommendation as Retrieval: A Simple, Strong Baseline

Raghav Gupta

Renat Aksitov

Samrat Phatale

Simral Chaudhary

Harrison Lee

Abhinav Rastogi

5th Workshop on NLP for Conversational AI (2023)

Automating Nearest Neighbor Search Configuration with Constrained Optimization

Phil Sun

Ruiqi Guo

Sanjiv Kumar

International Conference on Learning Representations (2023)

HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting

Jiaying Lu

Jiaming Shen

Bo Xiong

Wenjing Ma

Steffen Staab

Carl Yang

Proc. of The 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (2023)

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Information Retrieval and the Web

Recent Publications

Some of our teams

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Information Retrieval and the Web

Recent Publications

Some of our teams

Join us

AI/ML Foundations  & Capabilities