TRAD Chinese-English Email Parallel corpus - Development Set

Full Official Name: TRAD Chinese-English Email Parallel corpus - Development Set
Submission date: Nov. 21, 2016, 5:09 p.m.

This is a parallel corpus of 15,000 characters in Chinese (equivalent to 10,000 words) and a reference translation in English. The source texts are a selection of emails from the Speechocean King-NLP-001 corpus, a corpus of private emails collected from the daily life and business domains. The content has also been translated into French (see ELRA-W0114). This corpus was produced by ELDA within the PEA TRAD project supported by the French Ministry of Defence (DGA). It was used as a development set for MT systems.

Creator(s)
Distributor(s)
Right Holder(s)