December 3, 2025
From Waveforms to Wisdom: The New Benchmark for Auditory IntelligenceOctober 7, 2025
Speech-to-Retrieval (S2R): A new approach to voice searchJuly 2, 2025
Making group conversations more accessible with sound localizationMarch 21, 2025
Deciphering language processing in the human brain through LLM representationsJanuary 16, 2025
Zero-shot mono-to-binaural speech synthesisAugust 21, 2024
Restoring speaker voices with zero-shot cross-lingual voice transfer for TTSJuly 9, 2024
Assessing ASR performance with meaning preservationApril 17, 2024
Robust speech recognition in AR through infinite virtual rooms with acoustic modelingDecember 1, 2023
Unsupervised speech-to-speech translation from monolingual dataOctober 26, 2023
Spoken question answering and speech continuation using a spectrogram-powered LLMOctober 19, 2023
English learners can now practice speaking on SearchJune 22, 2023
SoundStorm: Efficient parallel audio generation