Resource: Finnish SpeechDat-Car
|Date of Submission||Jan. 24, 2014, 4:29 p.m.|
|Resource Type||Primary Text|
The Finnish SpeechDat-Car contains the recordings of 302 Finnish speakers from 3 major dialectal areas (with 13 sub-areas) (151 males, 151 females), recorded over the GSM telephone network, and in a car. This database is partitioned into 142 CDs (DVDs are also available).
The speech data files are in two formats. Four of the 5 microphones were recorded on the computer in the boot of the car. The speech data are stored as sequences of 16 kHz, 16 bit and uncompressed. The fifth microphone was connected to the cell phone, and was recorded on a remote machine, with compressed data stored as sequences of 8 bit A-law 8.kHz. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat-Car format and content specifications.
Each speaker uttered the following items:
* 2 voice activation keywords
The following age distribution has been obtained: 138 speakers are between 16 and 30, 89 speakers are between 31 and 45, and 75 speakers are between 46 and 60. No speaker are over 60.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.