Resource: Danish SpeechDat(II) FDB-4000
|Reference||Danish SpeechDat(II) FDB-4000|
|Date of Submission||Jan. 24, 2014, 4:22 p.m.|
|Resource Type||Primary Text|
The Danish SpeechDat(II) FDB-4000 comprises 4,000 Danish speakers (1,940 males, 2,060 females) recorded over the Danish fixed telephone network. This database is partitioned into 14 CDs. The first 13 CDs comprise 300 speakers sessions each, the 14th comprises 100 speakers.
This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat format and content specifications.
Speech samples are stored as sequences of 8-bit 8 kHz A-law. Each prompted utterance is stored in a separate file. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
Each speaker uttered the following items:
* 3 application words
The following age distribution has been obtained: 372 speakers are under 16, 1004 speakers are between 16 and 30, 1109 speakers are between 31 and 45, 901 speakers are between 46 and 60, and 614 speakers are over 60.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.