Daisy Stanton

Generative semi-supervised learning with a neural seq2seq noisy channel

Soroosh Mariooryad

Matt Shannon

Siyuan Ma

Tom Bagby

David Kao

Daisy Stanton

Eric Battenberg

RJ Skerry-Ryan

ICML Workshop on Structured Probabilistic Inference(2023)

Speaker Generation

Daisy Stanton

David Teh-Hwa Kao

Eric Battenberg

Matt Shannon

RJ Skerry-Ryan

Soroosh Mariooryad

Tom Bagby

ICASSP(2022)

Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Eric Battenberg

RJ Skerry-Ryan

Soroosh Mariooryad

Daisy Stanton

David Kao

Matt Shannon

Tom Bagby

ICASSP(2020)

Semi-Supervised Generative Modeling for Controllable Speech Synthesis

Raza Habib

Soroosh Mariooryad

Matt Shannon

Eric Battenberg

RJ Skerry-Ryan

Daisy Stanton

David Kao

Tom Bagby

ICLR(2019)

Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

Eric Battenberg

Soroosh Mariooryad

Daisy Stanton

RJ Skerry-Ryan

Matt Shannon

David Kao

Tom Bagby

arXiv(2019)

Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron

RJ Skerry-Ryan

Eric Battenberg

Ying Xiao

Yuxuan Wang

Daisy Stanton

Joel Shor

Ron J. Weiss

Rob Clark

Rif A. Saurous

International Conference on Machine Learning(2018)

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Yuxuan Wang

Daisy Stanton

Yu Zhang

RJ Skerry-Ryan

Eric Battenberg

Joel Shor

Ying Xiao

Fei Ren

Ye Jia

Rif A. Saurous

ICML(2018)

Tacotron: Towards End-to-End Speech Synthesis

Yuxuan Wang

RJ Skerry-Ryan

Daisy Stanton

Yonghui Wu

Ron J. Weiss

Navdeep Jaitly

Zongheng Yang

Ying Xiao

Zhifeng Chen

Samy Bengio

Quoc Le

Yannis Agiomyrgiannakis

Rob Clark

Rif A. Saurous

Interspeech(2017)

Uncovering Latent Style Factors for Expressive Speech Synthesis

Yuxuan Wang

RJ Skerry-Ryan

Ying Xiao

Daisy Stanton

Joel Shor

Eric Battenberg

Rob Clark

Rif A. Saurous

NIPS Workshop on Machine Learning for Audio Signal Processing (ML4Audio)(2017) (to appear)

Fix It Where It Fails: Pronunciation Learning by Mining Error Corrections from Speech Logs

Zhenzhen Kou

Daisy Stanton

Fuchun Peng

Françoise Beaufays

Trevor Strohman

ICASSP(2015)

Preview

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Daisy Stanton

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Daisy Stanton

Research Areas

Filter by:

Publications

Years

Research Areas

Teams

Join us

AI/ML Foundations  & Capabilities