Resource: SKM

Reference SmartKom Mobil
Date of Submission Jan. 24, 2014, 4:31 p.m.
Status accepted
ISLRN 059-238-901-611-7
Resource Type Primary Text
Media Type Audio, Video
Source
Language German
Description

The SmartKom corpora were produced at BAS in the years 1999 to 2003 within the SmartKom project which was funded by the German Ministry of Education and Science. The corpus consists of multi-modal recordings (“sessions”) of 224 persons in a Wizard-of-Oz setting.
Release SKM 1.0 contains 146 recordings in the technical setup (“scenario”) SmartKom Mobil which is a portable PDA equipped with a net link and additional intelligent communication devices. Naive users were asked to test a “prototype” for a market study not knowing that the system was in fact controlled by two human operators. They were asked to solve two tasks in a period of 4,5 min while they were left alone with the system. The instruction was kept to a minimum; in fact the user only knew that the system is able to understand speech, gestures and should more or less communicate like a human.
Experiments were not performed in the field but rather in a studio-like environment. Background noise was played back artificially and the users did not carry the PDA in their hand but rather used a much smaller version of the SIVIT projection plane (to simulate a PDA display) and a pen as a pointing device. Speakers were speaking to a headset microphone.
Main technical features of release SKM 1.0
• Technical setup: Mobil (scenario)
• Primary domain “Tourism”; secondary domain “Telephony”
• Primary domain “Navigation”; secondary domain “Looking for parking place”
• 73 users
• 146 recording sessions; size: 490 GB
• Recorded modalities:
o Audio in max 9 channels
o Video of face
o Video of upper body from the left
o Infrared video of the display area (to capture the 2D gestures) as input to the SIVIT device (Siemens gesture recognizer)
o Video of the GUI output
o Coordinates of graphic tableau (when pen was used)
o Coordinates of SIVIT device (when finger/hands were used)
• Annotations:
o Transliteration
o 2D Gesture
o user states in three modalities
o Turn segmentation
• Documentation, TechDoks and publications
• All annotations compatible to the “BAS Partitur Format” (BPF)

The full database is provided on USB. Single volumes on DVD can be obtained upon demand.

Version 1.0
Distributor ELRA