Google Research

Crowdsourced high-quality Argentinian Spanish [es-ar] speech multi-speaker dataset.


This dataset was collected for speech technology research.

This dataset contains transcribed high-quality (48kHz, 16 bit, mono, Wave audio) audio Spanish sentences recorded by volunteers in Buenos Aires, Argentina.

The dataset also contains recordings of simple weather messages recorded in Argentinian Spanish (90 messages), and Peninsular Spanish (90 messages).

Some quality checks have been done on the data, but there might still be mistranscriptions or artifacts in the audio.