Resource: Karl May Korpus (KMK)

Reference Karl May Korpus (KMK)
Date of Submission Jan. 24, 2014, 4:30 p.m.
Status accepted
ISLRN 628-817-117-400-1
Resource Type Primary Text
Media Type Text
Language German

The "Karl-May-Korpus" is a monolingual German corpus, available in an SGML-tagged ASCII text format. It contains the works of the German author Karl May (1842-1912) and consists of around 1.6 million words (divided into 9 subcorpora of about 180,000 words each). The corpus was created between 1993 and 1997.

Each word form is tagged with a word class (1 out of 43 classes) and appropriate lemma.

File format: Text
Standard in use: SGML
Character set: 8-bit ASCII

Version 1.0
Distributor ELRA