Sandy Ritchie

Connecting Language Technologies with Rich, Diverse Data Sources Covering Thousands of Languages

Sebastian Ruder

Julia Kreutzer

Clara Rivera

Ishank Saxena

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Multimodal Modeling for Spoken Language Identification

Shikhar Bharadwaj

Min Ma

Shikhar Vashishth

Ankur Bapna

Sriram (Sri) Ganapathy

Vera Axelrod

Sid Dalmia

Wei Han

Yu Zhang

Daan van Esch

Sandy Ritchie

Partha Talukdar

Jason Riesa

Proceedings of 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024) (2024)

LinguaMeta: Unified Metadata for Thousands of Languages

Sandy Ritchie

Daan van Esch

Uche Okonkwo

Shikhar Vashishth

Emily Drummond

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Chimane-Mosetén

Sandy Ritchie

Jeanette Sakel

Amazonian Languages: An International Handbook, De Gruyter Mouton (2023)

Large vocabulary speech recognition for languages of Africa: multilingual modeling and self-supervised learning

Sandy Ritchie

You-Chi Cheng

Mingqing Chen

Rajiv Mathews

Daan van Esch

Bo Li

Khe Chai Sim

(2022)

XTREME-S: Evaluating Cross-lingual Speech Representations

Ankur Bapna

Clara E. Rivera

Daan van Esch

Jason Riesa

Jon Clark

Melvin Johnson

Mihir Sanjay Kale

Min Ma

Orhan Firat

Sandy Ritchie

Sebastian Ruder

Simran Khanuja

Ye Jia

Yu Zhang

Proc. Interspeech 2022

Text Normalization for Low-Resource Languages of Africa

Andrew Zupon

Evan Elizabeth Crew

Sandy Ritchie

AfricaNLP (2021)

A Large Scale Low-Resource Pronunciation Data Set Mined From Wikipedia

Tania Chakraborty

Manasa Prasad

Theresa Breiner

Sandy Ritchie

Daan van Esch

arXiv cs.CL (2021)

Data-Driven Parametric Text Normalization: Rapidly Scaling Finite-State Transduction Verbalizers to New Languages

Sandy Ritchie

Eoin Mahon

Kim Anne Heiligenstein

Nikos Bampounis

Daan van Esch

Christian Schallhart

Jonas Fromseier Mortensen

Benoit Brard

Proceedings of the 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020), Language Resources and Evaluation Conference (LREC 2020), Marseille, 218–225

Unified Verbalization for Speech Recognition & Synthesis Across Languages

Sandy Ritchie

Richard Sproat

Kyle Gorman

Daan van Esch

Christian Schallhart

Nikos Bampounis

Benoit Brard

Jonas Fromseier Mortensen

Millie Holt

Eoin Mahon

Proceedings of Interspeech 2019

Defining the technology of today and tomorrow.

Philosophy

People

Research areas

Foundational ML & Algorithms

Computing Systems & Quantum AI

Science, AI & Society

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Sandy Ritchie

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Research areas

Foundational ML & Algorithms

Computing Systems & Quantum AI

Science, AI & Society

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Sandy Ritchie

Research Areas

Filter by:

Publications

Years

Research Areas

Teams

Join us