Resource: French lexicon with morphological information

Reference French lexicon with morphological information
Date of Submission Jan. 24, 2014, 4:29 p.m.
Status accepted
ISLRN 060-941-184-122-3
Resource Type Lexicon
Media Type Text
Source
Language French
Size 17 mb
Description

This French lexicon is made up of 424,000 inflected forms corresponding to 55,000 simple word lemmas. It contains:
- 34,400 nouns, with gender, number and inflected forms (including irregular forms)
- 7,300 verbs, with mood, tense, person, gender, number and inflected forms (including irregular forms),
- 11,700 adjectives, with gender, number and inflected forms (including irregular forms),
- 1,400 adverbs,
- 200 pronouns, articles, prepositions/postpositions and conjunctions.

Each line in the resource file shows an inflected form, its part of speech, its related lemma and its morphological information. The inflected forms were generated using two databases: one containing the lemmas with the related root(s) and paradigm number(s), the other one containing the paradigm numbers with the related terminations and morphological information.

Each row in the resource file consists of four fields following the structure below:
Lemma|part of speech|inflected form|morphological information

The part of speech and the morphological information are encoded using our internal standard (an abbreviation key file is also provided).

Version 1.0
Distributor ELRA