BAS PHATT 1.1.X (complete corpus)

Full Official Name: BAS PHATT 1.1.X (complete corpus)
Submission date: Jan. 24, 2014, 4:17 p.m.

The Ph@ttSessionz speech database, funded by the German Ministry of Science and Education (BMBF), contains recordings of 864 adolescent speakers of German (age range 12-20). The recordings were performed via the WWW in public schools (Gymnasium) in 41 locations in Germany. The speech material recorded is a superset of the German SpeechDat-II and RVG-I corpora (see also ELRA-S0051, S0058, S0063, S0096 and S0155). Recordings were done with SpeechRecorder in selected schools in the years 2005-2007. Both channels, the headset and the desktop microphone, were recorded in high quality. The BAS PHATT corpus is available in two versions: BAS PHATT 1.0.X (sub-set, ELRA-S0282-01) and BAS PHATT 1.1.X (complete corpus, ELRA-S0282-02). BAS PHATT 1.1.X contains: - 138 items: - 12 single digits - 18 numbers - 12 commands - 30 phonetically rich sentences - 13 telephone numbers - 9 digit strings: 3 all digits, 3 credit card numbers, 3 PIN codes - 3 date expressions - 12 spelling items: 2 arbitrary sequences, 5 geographical names, 5 person names - 3 geographical names - 3 company names - 2 person names - 11 phonetics test sentences - 3 time expressions - 8 spontaneous texts (text production): 5 short texts, 3 long texts - Total number of recordings: approx. 120,000 - Duration: ca. 12,500 minutes - Formats: WAV 22,05kHz, 16 bit, SpeechDat Transliteration, BAS Partitur Format (BPF) - Segmentation: manual segmentation begin/end utterance, automatic phonemic segmentation with MAUS - Distribution: 15 DVD-R Iso 9660 See also ELRA-S0082-01.

Creator(s)
Distributor(s)
Right Holder(s)