Crowd-sourced high-quality Tamil multi-speaker speech data set by Google

Full Official Name: Crowd-sourced high-quality Tamil multi-speaker speech data set by Google
Submission date: Sept. 26, 2019, 10:50 a.m.

This data set contains transcribed high-quality audio of Tamil sentences recorded by volunteers. The data set consists of wave files, and a TSV file (line_index.tsv). The file line_index.tsv contains a anonymized FileID and the transcription of audio in the file. The data set has been manually quality checked, but there might still be errors. License: Attribution-ShareAlike 4.0 International

Creator(s)
Distributor(s)
Right Holder(s)