Unified Verbalization for Speech Recognition & Synthesis Across Languages

Sandy Ritchie

Richard Sproat

Kyle Gorman

Daan van Esch

Christian Schallhart

Nikos Bampounis

Benoit Brard

Jonas Fromseier Mortensen

Millie Holt

Eoin Mahon

Proceedings of Interspeech 2019

Download Google Scholar

Abstract

We describe a new approach to converting written tokens to their spoken form, which can be used across automatic speech recognition (ASR) and text-to-speech synthesis (TTS) systems. Both ASR and TTS systems need to map from the written to the spoken domain, and we present an approach that enables us to share verbalization grammars between the two systems. We also describe improvements to an induction system for number name grammars. Between these shared ASR/TTS verbalization systems and the improved induction system for number name grammars, we see significant gains in development time and scalability across languages

Research Areas

Speech Processing
Natural Language Processing

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Unified Verbalization for Speech Recognition & Synthesis Across Languages

Abstract

Research Areas

Learn more about how we conduct our research

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Unified Verbalization for Speech Recognition & Synthesis Across Languages

Abstract

Research Areas

Learn more about how we conduct our research

AI/ML Foundations  & Capabilities