Resource: TRAD Arabic-English Mailing lists Parallel corpus - Development set

Reference TRAD Arabic-English Mailing lists Parallel corpus - Development set
Date of Submission Oct. 17, 2016, 10:56 a.m.
Status accepted
ISLRN 213-044-240-074-6
Resource Type Primary Text
Media Type Text
Source
Language Arabic, English
Description

This is a parallel corpus of 10,000 words in Arabic and a reference translation in English. The source texts are emails collected from Wikiar-I, a mailing list for discussions about the Arabic Wikipedia. The collected emails are dated from 2004 to 2007.

The translation has been conducted following a strict protocol aimed at producing high quality translations.

The content is also translated into French (see ELRA-W0107).

This corpus was produced by ELDA within the PEA TRAD project supported by the French Ministry of Defence (DGA). It was used as a development set for an internal MT evaluation campaign.

Version 1.0
Distributor ELRA