Resource: Original Short-Message Data Collation II in Chinese (PinYin)

Reference Original Short-Message Data Collation II in Chinese (PinYin)
Date of Submission Jan. 24, 2014, 4:30 p.m.
Status accepted
ISLRN 745-287-055-486-8
Resource Type Primary Text
Media Type Text
Source
Language Chinese
Description

This corpus comprises 2,604,901 characters, corresponding to 202,277 daily life short messages (SMS). This subset contains original messages together with PinYin transcription.
All data have been proofread manually with PinYin.

Version 1.0
Distributor ELRA