ISLRN

Wuhan Dialect Speech Data by Mobile Phone - 997 Hours

Full Official Name: Wuhan Dialect Speech Data by Mobile Phone - 997 Hours

Submission date: Oct. 7, 2022, 4:43 p.m.

Mobile phone captured audio data of Wuhan dialect, 997 hours in total, recorded by more than 2,000 Wuhan dialect native speakers. The recorded text covers generic, interactive, on-board, home and other categories, with rich contents. Wuhan locals participate in quality check and proofreading. Sentence accuracy rate reaches 95 %; this data set can be used for automatic speech recognition, machine translation, and voiceprint recognition. Format：16kHz, 16bit, uncompressed wav, mono channel Recording environments：quiet indoor environment, without echo Recording content (read speech)：generic category; human-machine interaction category; smart home command and control category; numbers; dialect Demographics：2,291 people, 55% of which are female. Transcription content：text, noisy symbols, special identifiers Device：Android mobile phone, iPhone Language：Wuhan dialect Accuracy rate：95% (the accuracy rate of noise symbols and other identifiers is not included) Application scenarios：speech recognition, voiceprint recognition

Creator(s)

Distributor(s)

ELRA

Right Holder(s)

Status : Accepted

ISLRN :

822-900-971-360-0

Version

1.0

Source

http://catalog.elra.info/en-us/repository/browse/ELRA-S0455

Resource Type

Primary Text

Media Type

Audio

Language(s)

Chinese

Access Medium