Exploring Unexplored Generalization Challenges for  Cross-Database Semantic Parsing

Alane Laughlin Suhr; Kenton Lee; Ming-Wei Chang; Pete Shaw

Exploring Unexplored Generalization Challenges for Cross-Database Semantic Parsing

Alane Laughlin Suhr

Kenton Lee

Ming-Wei Chang

Pete Shaw

ACL 2020

Download Google Scholar

Abstract

We study the task of cross-database semantic parsing (XSP), where a system that maps natural language utterances to executable SQL queries is evaluated on databases unseen during training. Recently, several datasets, including Spider, were proposed to support development of XSP systems. We propose a challenging evaluation setup for cross-database semantic parsing, focusing on variation across database schemas and in-domain language use. We re-purpose eight semantic parsing datasets that have been well-studied in the setting where in-domain training data is available, and instead use them as additional evaluation data for XSP systems instead. We build a system that performs well on Spider, and find that it struggles to generalize to our re-purposed set. Our setup uncovers several generalization challenges for cross-database semantic parsing, demonstrating the need to use and develop diverse training and evaluation datasets.

Research Areas

Natural language processing

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Exploring Unexplored Generalization Challenges for Cross-Database Semantic Parsing

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs