XTREME-S: Evaluating Cross-lingual Speech Representations

Ankur Bapna

Clara E. Rivera

Daan van Esch

Jason Riesa

Jon Clark

Melvin Johnson

Mihir Sanjay Kale

Min Ma

Orhan Firat

Sandy Ritchie

Sebastian Ruder

Simran Khanuja

Ye Jia

Yu Zhang

Proc. Interspeech 2022

Google Scholar

Abstract

We introduce \xtremes, a new benchmark to evaluate universal cross-lingual speech representations in many languages. XTREME-S covers four task families: speech recognition, classification, retrieval and speech-to-text translation. Covering 102 languages from 10+ language families, 3 different domains and 4 task families, XTREME-S aims to simplify multilingual speech representation evaluation, as well as catalyze research in ``universal'' speech representation learning. This paper describes the new benchmark and establishes the first speech-only and speech-text baselines using XLS-R and mSLAM on all downstream tasks. We motivate the design choices and detail how to use the benchmark. The code and pre-processing scripts will be made publicly available.\footnote{\small\url{https://huggingface.co/datasets/google/xtreme_s}}

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

XTREME-S: Evaluating Cross-lingual Speech Representations

Abstract

Research Areas

Learn more about how we conduct our research

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

XTREME-S: Evaluating Cross-lingual Speech Representations

Abstract

Research Areas

Learn more about how we conduct our research

AI/ML Foundations  & Capabilities