Resource: BAS PHATT 1.1.X (complete corpus)

Reference BAS PHATT 1.1.X (complete corpus)
Date of Submission Jan. 24, 2014, 4:17 p.m.
Status accepted
ISLRN 847-046-185-654-8
Resource Type Primary Text
Media Type Audio
Source
Language German
Description

The Ph@ttSessionz speech database, funded by the German Ministry of Science and Education (BMBF), contains recordings of 864 adolescent speakers of German (age range 12-20). The recordings were performed via the WWW in public schools (Gymnasium) in 41 locations in Germany. The speech material recorded is a superset of the German SpeechDat-II and RVG-I corpora (see also ELRA-S0051, S0058, S0063, S0096 and S0155). Recordings were done with SpeechRecorder in selected schools in the years 2005-2007. Both channels, the headset and the desktop microphone, were recorded in high quality.

The BAS PHATT corpus is available in two versions: BAS PHATT 1.0.X (sub-set, ELRA-S0282-01) and BAS PHATT 1.1.X (complete corpus, ELRA-S0282-02).

BAS PHATT 1.1.X contains:
- 138 items:
- 12 single digits
- 18 numbers
- 12 commands
- 30 phonetically rich sentences
- 13 telephone numbers
- 9 digit strings: 3 all digits, 3 credit card numbers, 3 PIN codes
- 3 date expressions
- 12 spelling items: 2 arbitrary sequences, 5 geographical names, 5 person names
- 3 geographical names
- 3 company names
- 2 person names
- 11 phonetics test sentences
- 3 time expressions
- 8 spontaneous texts (text production): 5 short texts, 3 long texts
- Total number of recordings: approx. 120,000
- Duration: ca. 12,500 minutes
- Formats: WAV 22,05kHz, 16 bit, SpeechDat Transliteration, BAS Partitur Format (BPF)
- Segmentation: manual segmentation begin/end utterance, automatic phonemic segmentation with MAUS
- Distribution: 15 DVD-R Iso 9660

See also ELRA-S0082-01.

Version 1.0
Distributor ELRA