
W. Ronny Huang
Ronny is a research scientist focused on building robust and generalizable algorithms for speech and language data. He has MS and PhD degrees from MIT electrical engineering where he demonstrated the first handheld laser-driven particle accelerator. More on his background here.
Authored Publications
Sort By
Google
E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
David Rybach
Cal Peyser
Zhiyun Lu
Interspeech 2022 (2022) (to appear)
SENTENCE-SELECT: LARGE-SCALE LANGUAGE MODEL DATA SELECTION FOR RARE-WORD SPEECH RECOGNITION
Cal Peyser
Ruoming Pang
Submitted to interspeech 2022 (2022) (to appear)
Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical {RNN} Model
You-Chi Cheng
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2022, Virtual and Singapore, 23-27 May 2022, {IEEE}, pp. 6097-6101
Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Cal Peyser
David Johannes Rybach
Interspeech (2021) (to appear)
GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training
Chen Zhu
Renkun Ni
Kezhi Kong
Tom Goldstein
Conference on Neural Information Processing Systems (NeurIPS) (2021) (to appear)
An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling
Rami Botros
Ruoming Pang
David Johannes Rybach
James Qin
Quoc-Nam Le-The
Anmol Gulati
Cal Peyser
Chung-Cheng Chiu
Emmanuel Guzman
Jiahui Yu
Qiao Liang
Wei Li
Yonghui Wu
Yu Zhang
Interspeech (2021) (to appear)