Disordered Speech Data Collection: Lessons Learned at 1 Million Utterances from Project Euphonia

Bob MacDonald; Pan-Pan Jiang; Julie Cattiau; Rus Heywood; Richard Cave; Katie Seaver; Marilyn Ladewig; Jimmy Tobin; Michael Brenner; Philip Q Nelson; Jordan R. Green; Katrin Tomanek

Disordered Speech Data Collection: Lessons Learned at 1 Million Utterances from Project Euphonia

Bob MacDonald

Pan-Pan Jiang

Julie Cattiau

Rus Heywood

Richard Cave

Katie Seaver

Marilyn Ladewig

Jimmy Tobin

Michael Brenner

Philip Q Nelson

Jordan R. Green

Katrin Tomanek

Interspeech (2021) (to appear)

Google Scholar

Abstract

Speech samples from over 1000 individuals with impaired speech have been submitted for Project Euphonia, aimed at improving automated speech recognition for atypical speech. We provide an update on the contents of the corpus, which recently passed 1 million utterances, and review key lessons learned from this project.
The reasoning behind decisions such as phrase set composition, prompted vs extemporaneous speech, metadata and data quality efforts are explained based on findings from both technical and user-facing research.

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Disordered Speech Data Collection: Lessons Learned at 1 Million Utterances from Project Euphonia

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs