Speech Processing
- Algorithms & Theory
- Climate & Sustainability
- Conferences & Events
- Data Management
- Data Mining & Modeling
- Distributed Systems & Parallel Computing
- Economics & Electronic Commerce
- Education Innovation
- General Science
- Generative AI
- Global
- Hardware & Architecture
- Health & Bioscience
- Human-Computer Interaction and Visualization
- Machine Intelligence
- Machine Perception
- Machine Translation
- Mobile Systems
- Natural Language Processing
- Networking
- Open Source Models & Datasets
- Photography
- Product
- Programs
- Quantum
- RAI-HCT Highlights
- Responsible AI
- Robotics
- Security, Privacy and Abuse Prevention
- Software Systems & Engineering
- Sound & Accoustics
- Speech Processing
- Year in Review
-
January 16, 2025
Zero-shot mono-to-binaural speech synthesis- Sound & Accoustics ·
- Speech Processing
-
August 21, 2024
Restoring speaker voices with zero-shot cross-lingual voice transfer for TTS- Human-Computer Interaction and Visualization ·
- Machine Translation ·
- Speech Processing
-
July 9, 2024
Assessing ASR performance with meaning preservation- Responsible AI ·
- Speech Processing
-
April 17, 2024
Robust speech recognition in AR through infinite virtual rooms with acoustic modeling- Human-Computer Interaction and Visualization ·
- Machine Perception ·
- Speech Processing
-
December 1, 2023
Unsupervised speech-to-speech translation from monolingual data- Machine Translation ·
- Product ·
- Speech Processing
-
October 26, 2023
Spoken question answering and speech continuation using a spectrogram-powered LLM- Natural Language Processing ·
- Speech Processing
-
October 19, 2023
English learners can now practice speaking on Search- Education Innovation ·
- Product ·
- Speech Processing
-
June 22, 2023
SoundStorm: Efficient parallel audio generation- Sound & Accoustics ·
- Speech Processing
-
June 21, 2023
Responsible AI at Google Research: AI for Social Good- Human-Computer Interaction and Visualization ·
- RAI-HCT Highlights ·
- Speech Processing
-
June 7, 2023
Evaluating speech synthesis in many languages with SQuId- Conferences & Events ·
- Speech Processing
-
June 2, 2023
AVFormer: Injecting vision into frozen speech models for zero-shot AV-ASR- Machine Intelligence ·
- Speech Processing
-
March 6, 2023
Universal Speech Model (USM): State-of-the-art speech AI for 100+ languages- Speech Processing