Resource: TRAD Chinese-English Web domain (blogs) Parallel corpus

Reference TRAD Chinese-English Web domain (blogs) Parallel corpus
Date of Submission Oct. 17, 2016, 10:56 a.m.
Status accepted
ISLRN 982-341-079-331-4
Resource Type Primary Text
Media Type Text
Source
Language Chinese, English
Description

This is a parallel corpus of 15,000 characters in Chinese (equivalent to 10,000 words) and 2 reference translations in English. The source texts are blog articles dealing with various subjects such as economy, environment, society, technologies, etc. Articles are dated from June 2013. The translation has been conducted by two different translation teams following a strict protocol aimed at producing high quality translations.

The content has also been translated into French (see ELRA-W0109).

This corpus was produced by ELDA within the PEA TRAD project supported by the French Ministry of Defence (DGA). It was used as a test set for an internal MT evaluation campaign.

Version 1.0
Distributor ELRA