Speech Intelligibility Classifiers from 550k Disordered Speech Samples

Subhashini Venugopalan

Jimmy Tobin

Samuel Yang

Katie Seaver

Richard Cave

Pan-Pan Jiang

Neil Zeghidour

Rus Heywood

Jordan Green

Michael Brenner

ICASSP, Icassp submission. 2022 (2023)

Download Google Scholar

Abstract

We developed dysarthric speech intelligibility classifiers on 551,176 disordered speech samples contributed by a diverse set of 468 speakers, with a range of self-reported speaking disorders and rated for their overall intelligibility on a fivepoint scale. We trained three models following different deep learning approaches and evaluated them on ∼94K utterances from 100 speakers. We further found the models to generalize well (without further training) on the TORGO database (100% accuracy), UASpeech (0.93 correlation), ALS-TDI PMP (0.81 AUC) datasets as well as on a dataset of realistic unprompted speech we gathered (106 dysarthric and 76 control speakers, ∼2300 samples).

Research Areas

Machine Intelligence

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Speech Intelligibility Classifiers from 550k Disordered Speech Samples

Abstract

Research Areas

Learn more about how we conduct our research

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Speech Intelligibility Classifiers from 550k Disordered Speech Samples

Abstract

Research Areas

Learn more about how we conduct our research

AI/ML Foundations  & Capabilities