Speech Processing

December 3, 2025
From Waveforms to Wisdom: The New Benchmark for Auditory Intelligence
- Machine Intelligence ·
- Sound & Accoustics ·
- Speech Processing
October 7, 2025
Speech-to-Retrieval (S2R): A new approach to voice search
- Machine Intelligence ·
- Natural Language Processing ·
- Product ·
- Speech Processing
July 2, 2025
Making group conversations more accessible with sound localization
- Human-Computer Interaction and Visualization ·
- Sound & Accoustics ·
- Speech Processing
March 21, 2025
Deciphering language processing in the human brain through LLM representations
- General Science ·
- Health & Bioscience ·
- Natural Language Processing ·
- Speech Processing
January 16, 2025
Zero-shot mono-to-binaural speech synthesis
- Sound & Accoustics ·
- Speech Processing
August 21, 2024
Restoring speaker voices with zero-shot cross-lingual voice transfer for TTS
- Human-Computer Interaction and Visualization ·
- Machine Translation ·
- Speech Processing
July 9, 2024
Assessing ASR performance with meaning preservation
- Responsible AI ·
- Speech Processing
April 17, 2024
Robust speech recognition in AR through infinite virtual rooms with acoustic modeling
- Human-Computer Interaction and Visualization ·
- Machine Perception ·
- Speech Processing
December 1, 2023
Unsupervised speech-to-speech translation from monolingual data
- Machine Translation ·
- Product ·
- Speech Processing
October 26, 2023
Spoken question answering and speech continuation using a spectrogram-powered LLM
- Natural Language Processing ·
- Speech Processing
October 19, 2023
English learners can now practice speaking on Search
- Education Innovation ·
- Product ·
- Speech Processing
June 22, 2023
SoundStorm: Efficient parallel audio generation
- Sound & Accoustics ·
- Speech Processing