The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Portuguese consists of 40,000 lemmas (3,500,000 forms) as well as the following extra features: Tense, Mood, Person, Number, Gender and Pronominal Clitics.