Wuhan Dialect Speech Data by Mobile Phone - 997 Hours

Full Official Name: Wuhan Dialect Speech Data by Mobile Phone - 997 Hours
Submission date: Oct. 7, 2022, 4:43 p.m.

Mobile phone captured audio data of Wuhan dialect, 997 hours in total, recorded by more than 2,000 Wuhan dialect native speakers. The recorded text covers generic, interactive, on-board, home and other categories, with rich contents. Wuhan locals participate in quality check and proofreading. Sentence accuracy rate reaches 95 %; this data set can be used for automatic speech recognition, machine translation, and voiceprint recognition. Format:16kHz, 16bit, uncompressed wav, mono channel Recording environments:quiet indoor environment, without echo Recording content (read speech):generic category; human-machine interaction category; smart home command and control category; numbers; dialect Demographics:2,291 people, 55% of which are female. Transcription content:text, noisy symbols, special identifiers Device:Android mobile phone, iPhone Language:Wuhan dialect Accuracy rate:95% (the accuracy rate of noise symbols and other identifiers is not included) Application scenarios:speech recognition, voiceprint recognition

Right Holder(s)