Reference FASiL multimodal “fasil-mm” corpus
Date of Submission Jan. 24, 2014, 4:29 p.m.
Status accepted
ISLRN 377-306-017-564-7
Resource Type Primary Text
Media Type Audio, Video
Language English, Portuguese, Swedish

The corpus was collected in the context of the FASiL project, EU FP5 IST-2001-38685 (, as a wizard-of-oz experiment. Therefore, there are sound and interaction recordings of subject and wizard. A total of 90 subjects were recorded (30 per language: English, Portuguese and Swedish).
The corpus is formatted as .wav files (u-law) for audio, plain ASCII text (.txt) for transcriptions, and a TASX .XML for annotations which binds everything together.
The multimodal woz experiment is about the voice interaction with a Virtual Personal Assistent (VPA) for an email, calender and contacts task. Hesitations are marked as “UH”, noise as “NOISE” and other irrelevant stuff as “IRRELEVANT”. All annotations are in lower case, except for the former mentioned cases.
Exact documentation of experiment in FASiL deliverable D.2.2_b.
See also S0174-01, S0174-02, S0174-03, and S0174-04.

Version 1.0
Distributor ELRA