Resource: Romanian - English literature corpus (Processed)

Reference Romanian - English literature corpus (Processed)
Date of Submission March 9, 2020, 12:27 p.m.
Status accepted
ISLRN 050-476-818-226-7
Resource Type Primary Text
Media Type Text
Source
Language English, Romanian
Format/MIME Type application/x-tmx+xml
Size 5280 translationUnits, 176179 words
Access Medium downloadable
Description

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu.
Bilingual Romanian – English literature corpus built from a small set of freely available literature books (drama, sci-fi, etc.). The texts are positionally aligned, i.e. the sentence on line i in the English text is aligned with the sentence on line i in the Romanian text. Alignment was manually validated.

Version 2.0
Distributor ELRA
Rights Holder Dan Tufis