Resource: German SpeechDat-Car
|Date of Submission||Jan. 24, 2014, 4:29 p.m.|
|Resource Type||Primary Text|
The German SpeechDat-Car database comprises 338 German speakers recorded over the mobile telephone network. This database is partitioned into 17 DVDs and 1 CD. The speech databases made within the SpeechDat-Car project were validated by SPEX, the Netherlands, to assess their compliance with the SpeechDat-Car format and content specifications.
The speech data files are in two formats. The signal data format for the in-car mobile platform recordings is 16 kHz, 16 bit, uncompressed unsigned integers in Intel format (lo-hi byte order); the channels are multiplexed in a single file, with the channel sequence being 0-1-2-3. The format of the fixed platform audio files is 8 kHz, 8 bit alaw encoding. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
Each speaker uttered the following items:
The following age distribution has been obtained: 187 speakers are between 16 and 30, 72 speakers are between 31 and 45, 70 speakers are between 46 and 60, and 9 speakers are over 60.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.