
Georg Heigold
Georg Heigold received the Diplom degree in
physics from ETH Zurich, Switzerland, in 2000.
He was a Software Engineer at De La Rue, Berne,
Switzerland, from 2000 to 2003. From 2004 to 2010,
he was with the Computer Science Department,
RWTH Aachen University, Aachen, University.
Since 2010, he has been a Research Scientist at
Google, Mountain View, CA. His research interests
include automatic speech recognition, discriminative
training, and log-linear modeling.
Research Areas
Authored Publications
Sort By
Google
Conditional Object-Centric Learning from Video
Thomas Kipf
Gamaleldin Fathy Elsayed
Austin Stone
Rico Jonschkowski
Alexey Dosovitskiy
Klaus Greff
ICLR, ICLR (2022)
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexander Kolesnikov
Alexey Dosovitskiy
Dirk Weissenborn
Jakob Uszkoreit
Lucas Beyer
Matthias Minderer
Neil Houlsby
Sylvain Gelly
Thomas Unterthiner
Xiaohua Zhai
ICLR (2021)
Object-Centric Learning with Slot Attention
Francesco Locatello
Dirk Weissenborn
Thomas Unterthiner
Jakob Uszkoreit
Alexey Dosovitskiy
Thomas Kipf
NeurIPS 2020
End-to-End Text-Dependent Speaker Verification
Samy Bengio
Noam M. Shazeer
International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE (2016)
Asynchronous, Online, GMM-free Training of a Context Dependent Acoustic Model for Speech Recognition
Preview
Proceedings of the European Conference on Speech Communication and Technology (2014) (to appear)
Asynchronous Stochastic Optimization for Sequence Training of Deep Neural Networks
Erik McDermott
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE, Firenze, Italy (2014)
GMM-Free DNN Training
Preview
Proceedings of the International Conference on Acoustics,Speech and Signal Processing (2014)
Sequence Discriminative Distributed Training of Long Short-Term Memory Recurrent Neural Networks
Andrew Senior
Erik McDermott
Rajat Monga
Mark Mao
Interspeech (2014)
Word Embeddings for Speech Recognition
Samy Bengio
Proceedings of the 15th Conference of the International Speech Communication Association, Interspeech (2014)