Resource: Italian TTS Speech Corpus (Appen)

Reference Italian TTS Speech Corpus (Appen)
Date of Submission Jan. 24, 2014, 4:29 p.m.
Status accepted
ISLRN 976-246-706-503-6
Resource Type Primary Text
Media Type Audio
Source
Language Italian
Description

The Italian TTS Speech Corpus contains the recordings of 1 native Italian speaker (male, 50 years old) recorded in a studio over 1 channel (Shure SM15 unidirectional professional head-word condenser microphone). The data collection and transcription were performed by Appen (Australia).
Speech samples are stored as sequences of 16-bit 22.05 kHz PCM in uncompressed WAV files.
The speaker read 3,300 prompted sentences covering all legal triphones and diphones.
The database is provided with orthographic transcriptions in SAMPA, including canonical and alternative pronunciation, and syllable, stress and acoustic events markings. All transcriptions were segmented at the utterance (sentence/command word) level, annotated at the word level and checked manually. A pronunciation lexicon including 7,300 headwords (plus variants) is also available.
This database is aimed to be used within text-to-speech and speech synthesis applications.

Version 1.0
Distributor ELRA