Scott Wisdom

TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition

Hakan Erdogan

Scott Wisdom

Xuankai Chang

Zalán Borsos

Marco Tagliasacchi

Neil Zeghidour

John Hershey

Interspeech 2023

AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation

Efthymios Tzinis

Scott Wisdom

Tal Remez

John Hershey

European Conference on Computer Vision (ECCV)(2022)

Improving Bird Classification with Unsupervised Sound Separation

Tom Denton

Scott Wisdom

John Hershey

ICASSP 2022

Don’t Listen to What You Can’t See: The Importance of Negative Examples for Audio-Visual On-Screen Sound Separation

Efthymios Tzinis

Scott Wisdom

John Hershey

ECCV 2022 Workshop on AV4D: Visual Learning of Sounds in Spaces

Listening with Googlears: Low-Latency Neural Multiframe Beamforming and Equalization for Hearing Aids

Samuel Yang

Scott Wisdom

Chet Gnegy

Richard F. Lyon

Sagar Savla

Interspeech 2022(2022)

Adapting Speech Separation Systems to Real-World Meetings Using Mixture Invariant Training

Aswin Sivaraman

Scott Wisdom

Hakan Erdogan

John Hershey

ICASSP 2022

Sparse, Efficient, and Semantic MixIT: Taming In-the-Wild Unsupervised Sound Separation

Scott Wisdom

Aren Jansen

Ron J. Weiss

Hakan Erdogan

John Hershey

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)(2021)

What's All the FUSS About Free Universal Sound Separation Data?

Scott Wisdom

Hakan Erdogan

Dan Ellis

Romain Serizel

Nicolas Turpault

Eduardo Fonseca

Justin Salamon

Prem Seetharaman

John Hershey

ICASSP 2021

DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement

Yuma Koizumi

Shigeki Karita

Scott Wisdom

Hakan Erdogan

John Hershey

Lion Jones

Michiel Adriaan Unico Bacchiani

Proc. IEEE Workshop Appl. Signal Process. Audio Acoust. (WASPAA)(2021)

Self-Supervised Learning from Automatically Separated Sound Scenes

Eduardo Fonseca

Aren Jansen

Dan Ellis

Scott Wisdom

Marco Tagliasacchi

John Hershey

Manoj Plakal

Shawn Hershey

R. Channing Moore

Xavier Serra

WASPAA 2021(2021)

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Scott Wisdom

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Scott Wisdom

Research Areas

Filter by:

Publications

Years

Research Areas

Teams

Join us

AI/ML Foundations  & Capabilities