Public Procurement Dataset 1 (Processed)

Full Official Name: Public Procurement Dataset 1 (Processed)
Submission date: March 2, 2020, 11:46 a.m.

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A collection of parallel Polish-English texts published by the Polish Public Procurement Office. Sentence-level alignment of translation segments was carried out manually and encoded in the XLiFF format. There are two publications in the collection: a) Report on functioning of public procurement system in 2009 (raport_uzp_2009.xlf, 1495 segments 65237 words) and b) Report on functioning of public procurement system in 2010 (raport_uzp_2010.xlf, 1188 segments, 58684 words). The total size of the collection is 123 921 words in 2683 parallel segments. It was converted into a 1578-TUs English-Polish resource in TMX format.

Creator(s)
Distributor(s)
Right Holder(s)