Zero-shot Entity Linking by Reading Entity Descriptions

Lajanugen Logeswaran

Ming-Wei Chang

Kenton Lee

Kristina N. Toutanova

Jacob Devlin

Honglak Lee

ACL 2019

Download Google Scholar

Abstract

We present the zero-shot entity linking task, where mentions must be linked to unseen entities without in-domain labeled data. The goal is to enable robust transfer to highly specialized domains, and so no metadata or alias tables are assumed. In this setting, entities are only identified by text descriptions, and models must rely strictly on language understanding to resolve the new entities. First, we show that strong reading comprehension models pretrained on large unlabeled data can be used to generalize to unseen entities. Second, we propose a simple and effective adaptive pretraining strategy, which we term domain-adaptive pretraining (DAP), to address the domain shift problem associated with linking unseen entities in a new domain. We present experiments on a new dataset that we construct for this task and show that DAP improves over strong pretraining baselines, including BERT. The data and code are available at https://github.com/lajanugen/zeshel.

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Zero-shot Entity Linking by Reading Entity Descriptions

Abstract

Research Areas

Learn more about how we conduct our research

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Zero-shot Entity Linking by Reading Entity Descriptions

Abstract

Research Areas

Learn more about how we conduct our research

AI/ML Foundations  & Capabilities