Resource: GLiCom Spanish Wordform list - Regular word-forms

Reference GLiCom Spanish Wordform list - Regular word-forms
Date of Submission Nov. 2, 2015, 6:24 p.m.
Status accepted
ISLRN 282-492-887-361-0
Resource Type Lexicon
Media Type Text
Source
Language Spanish, Castilian
Access Medium CD-ROM
Description

GLiCom Spanish Wordform List v.1 is a computational lexicon of inflected wordforms in Spanish. Each entry has the following information: (i) lemma, (ii) morphosyntactic tag, and (iii) word type. This lexicon can be used in any application for Text Analysis in Spanish, in particular those in need for a lemmatizer, POS tagger, or Named Entity recogniser.

This set consists of a subdivision of the full lexicon and contains the list of word forms only. For the full lexicon, see ELRA-L0095-01.

The list of wordforms contains 1,152,242 entries, including (i) regular words (1,144,086), (ii) toponyms and anthroponyms (8,032), (iii) abbreviations and acronyms (775), and (iv) computational terms (124). Each entry consists of: form, lemma, morphosyntactic tag and the word type.

Version 1.0
Distributor ELRA