Google Research

Crowdsourced Nepali [ne-np] ASR dataset.


This dataset was collected for speech technology research from native Nepali speakers who volunteered to supply the data. The audio was recorded on standard consumer smartphones, in various environments. The audio is delivered in a downsampled lossless format (16kHz, 16 bit, mono, FLAC audio).

Some quality checks have been done on the data, but there might still be mistranscriptions or artifacts in the audio.