Resource: Spanish SpeechDat(II) FDB-1000
|Reference||Spanish SpeechDat(II) FDB-1000|
|Date of Submission||Jan. 24, 2014, 4:31 p.m.|
|Resource Type||Primary Text|
The Castillian Spanish SpeechDat(II) FDB-1000 database contains the recordings of 1,000 Castillian Spanish speakers (481 males, 519 females) recorded over the Spanish fixed telephone network. The FDB-1000 database is partitioned into 4 CDs, which comprise 250 speakers sessions each.
Speech samples are stored as sequences of 8-bit 8 kHz A-law. Each prompted utterance is stored in a separate file. Each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat format and content specifications.
Each speaker uttered the following items:
* 3 application words
The following age distribution has been obtained: 19 speakers are under 16, 555 speakers are between 16 and 30, 198 speakers are between 31 and 45, 198 speakers are between 46 and 60, and 30 speakers are over 60.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
This database is a subset of the Spanish SpeechDat(II) FDB-4000 (ref. ELRA-S0102).