Resource: Karl May Korpus (KMK)
|Reference||Karl May Korpus (KMK)|
|Date of Submission||Jan. 24, 2014, 4:30 p.m.|
|Resource Type||Primary Text|
The "Karl-May-Korpus" is a monolingual German corpus, available in an SGML-tagged ASCII text format. It contains the works of the German author Karl May (1842-1912) and consists of around 1.6 million words (divided into 9 subcorpora of about 180,000 words each). The corpus was created between 1993 and 1997.
Each word form is tagged with a word class (1 out of 43 classes) and appropriate lemma.
File format: Text