Public Procurement Dataset 2 (Processed)

Full Official Name: Public Procurement Dataset 2 (Processed)
Submission date: March 2, 2020, 11:46 a.m.

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A collection of parallel Polish-English texts published by the Polish Public Procurement Office. Sentence-level alignment of translation segments was carried out manually and encoded in the XLiFF format. There are two publications in the collection: a) Report on functioning of public procurement system in 2006 (raport_uzp_2006.xlf, 705 segments 31141 words) and b) Report on functioning of public procurement system in 2011 (raport_uzp_2010.xlf, 1434 segments,73456 words). The total size of the collection is 104597 words in 2139 parallel segments. It was converted into a 1842-TUs English-Polish resource in TMX format.

Creator(s)
Distributor(s)
Right Holder(s)