Bitext Lexical Dataset - Language Variants - Arabic

Full Official Name: Bitext Lexical Dataset - Language Variants - Arabic
Submission date: July 17, 2023, 5:25 p.m.

As a complement to the generic vocabulary provided in ELRA-L0136, language variants of Arabic are provided with the following features: Voice, Tense, Mood, Person, Number, Gender, Case, Definiteness, Pronominal Clitics, Category (except for Arabic MSA). Variants are distributed as follows: - Arabic Gulf: 20,000 lemmas / 9,000,000 forms - Arabic Najdi: 20,000 lemmas / 1,000,000 forms - Arabic Egypt: 20,000 lemmas / 1,000,000 forms - Arabic MSA: 22,000 lemmas / 17,800,000 forms

Creator(s)
Distributor(s)
Right Holder(s)