Text-to-Text Pre-Training for Data-to-Text Tasks

Mihir Sanjay Kale; Abhinav Kumar Rastogi

Text-to-Text Pre-Training for Data-to-Text Tasks

Mihir Sanjay Kale

Abhinav Kumar Rastogi

Proceedings of the 13th International Conference on Natural Language Generation (INLG 2020)

Download Google Scholar

Abstract

We study the pre-train + fine-tune strategy for data-to-text tasks. Our experiments indicate that text-to-text pre-training in the form of T5 (Raffel et al., 2019), enables simple, end-to-end transformer based models to outperform pipelined neural architectures tailored for data-to-text generation, as well as alternatives such as BERT and GPT-2. Importantly, T5 pre-training leads to better generalization, as evidenced by large improvements on out-of-domain test sets. We hope our work serves as a useful baseline for future research, as transfer learning becomes ever more prevalent for data-to-text tasks.

Research Areas

Natural language processing

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Text-to-Text Pre-Training for Data-to-Text Tasks

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs