Resource: Mandarin Chinese Desktop Speech Recognition Corpus - SMS (200 people)
|Reference||Mandarin Chinese Desktop Speech Recognition Corpus - SMS (200 people)|
|Date of Submission||Jan. 24, 2014, 4:30 p.m.|
|Resource Type||Primary Text|
This corpus comprises 7,276 entries uttered by 200 speakers of different dialects, ages and various educational levels (87 males and 113 females), recorded over 4 channels (Mic1: SHURE SM58; Mic2: ANC-700 Head-mounted; Mic3: TELEX M-60; Mic4: ACOUSTIC MAGIC). The database comprises 23,949 short messages (SMS) per channel. Speech samples are stored as a sequence of 16-bit 22.05kHz WAV for 35.6 hours of speech per channel. The total capacity of the data is 21.1 Gb.