
Full Official Name: Litkey Corpus
Submission date: July 18, 2019, 4:26 p.m.

TheLitkey Corpus is a richly-annotated longitudinal corpus of written texts produced by primary school children in Germany from grades 2 to 4. It has been transcribed and annotated at various linguistic levels, which include POS tags, features of the word-internal structure (phonemes, syllables, morphemes) and key orthographic features of the target words as well as a categorization of spelling errors. Comprehensive evaluations show that high accuracy was achieved on all levels, making the Litkey Corpus a useful resource for corpus-based research on literacy acquisition of German primary school children and for developing NLP tools for educational purposes. The corpus is freely available under

Right Holder(s)