Resource: PolyVar

Reference PolyVar
Date of Submission Jan. 24, 2014, 4:31 p.m.
Status accepted
ISLRN 481-023-813-564-2
Resource Type Primary Text
Media Type Audio
Source
Language French
Description

PolyVar is a speaker verification database comprising native and non-native speakers of French, mainly from Switzerland but also from other European countries. It consists of read and spontaneous speech recorded by 143 speakers (85 male and 58 female) amounting to 160 hours of speech. Each speaker recorded from 1 to 229 sessions, giving a total of 3,600 recorded sessions. The data are provided with orthographic annotation.
The number of calls per speaker is as follows:

13 speakers called 100 times
9 speakers called from 51 to 100 times
16 speakers called from 21 to 50 times
3 speakers called from 11 to 20 times
31 speakers called from 2 to 10 times
71 speakers called only once

Each speaker uttered up to 53 different items per session, including:

* 3 sequences of digits (1 ID number, 1 credit card number and 1 sequence of 6 digits)
* 24 application words (17 words about touris in Martigny)
* 10 read sentences
* 4 numbers (2 natural numbers, 2 amounts)
* 2 items with dates (1 read/1 spontaneous)
* 2 items with hours (1 read/1 spontaneous)
* 2 spelled words
* 3 spontaneous answers (questions about their gender, native language and the weather)
* 1 comment
* 1 telephone enquiry

File format: 8-bit a-law
Standard in use: NIST
Sampling rate: 8 kHz
Medium: 8 CD-ROMs

See also ELRA-S0047.

Version 1.0
Distributor ELRA