Resource: UCLA Speaker Variability Database

Reference UCLA Speaker Variability Database
Date of Submission Oct. 15, 2021, 6:22 p.m.
Status accepted
ISLRN 890-519-200-259-7
Resource Type Primary Text
Media Type Text, Audio
Language English
Format/MIME Type text/plain, audio/x-flac
Size 1711157 KB
Access Medium Web Download


UCLA Speaker Variability Database was developed by UCLA Speech Processing and Auditory Perception Laboratory and is comprised of approximately 34 hours of English speech and orthographic transcripts.

This corpus was designed to sample variability in speaking within individual speakers and across a large number of speakers. Participants took part in six different tasks: vowel sounds, reading sentences, giving instructions, neutral conversation, happy conversation, a phone conversation, annoyed conversation, and responding to a video.


Speakers (101 female, 101 male) were recruited from the UCLA university community and represented a variety of language backgrounds. Speaker metadata is included in the documentation for this release.

Audio files are presented as 16kHz, 16-bit single channel, flac compressed wav files.

Most speech files have two corresponding Praat TextGrid files: an orthographic sentence/utterance transcription and output from forced alignment.

Version 1.0
Creator Jody Kreiman , Abeer Alwan , Patricia Keating , Adam Chong , Yoonjeong Lee
Distributor Linguistic Data Consortium
Rights Holder Portions © 2021 Abeer Alwan, © 2021 Trustees of the University of Pennsylvania