SEAHORSE: A Dataset of Summaries Annotated with Human Ratings in Six Languages

Elizabeth Clark; Shruti Rijhwani; Sebastian Gehrmann; Joshua Maynez; Roee Aharoni; Vitaly Nikolaev; Thibault Sellam; Aditya Siddhant; Dipanjan Das; Ankur Parikh

SEAHORSE: A Dataset of Summaries Annotated with Human Ratings in Six Languages

Elizabeth Clark

Shruti Rijhwani

Sebastian Gehrmann

Joshua Maynez

Roee Aharoni

Vitaly Nikolaev

Thibault Sellam

Aditya Siddhant

Dipanjan Das

Ankur Parikh

EMNLP 2023, Association for Computational Linguistics (2023)

Download Google Scholar

Abstract

We introduce Seahorse (SummariEs Annotated with Human Ratings in Six languagEs), a dataset of 96K summaries with ratings along 6 dimensions (comprehensibility, repetition, grammar, attribution, main idea(s), and conciseness). The summaries are generated from 8 different models, conditioned on source text from 4 datasets in 6 languages (German, English, Spanish, Russian, Turkish, and Vietnamese). We release the annotated summaries as a resource for developing better summarization models and automatic metrics. We present an analysis of the dataset's composition and quality, and we demonstrate the potential of this dataset for building better summarization metrics, showing that metrics finetuned with Seahorse data outperform baseline metrics.

Research Areas

Natural language processing

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

SEAHORSE: A Dataset of Summaries Annotated with Human Ratings in Six Languages

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs