Resource: Venice Italian Treebank (VIT)
|Reference||Venice Italian Treebank (VIT)|
|Date of Submission||Jan. 24, 2014, 4:32 p.m.|
|Resource Type||Primary Text|
|Format/MIME Type||Plain text|
The VIT, Venice Italian Treebank is the effort of the collaboration of people working at the Laboratory of Computational Linguistics of the University of Venice in the years 1995-2005. It is partly the result of annotation carried out internally with no specific project in mind and no financial support. This work was partly related to the development of a lexicon, a morphological analyzer, a tagger, a deep parser of Italian. All these resources were finally ready at the beginning of the ‘90s when the LCL got involved in the first national projects.
The VIT contains about 272,000 words distributed over six different domains, and this is what makes it so relevant for the study of the structure of Italian language. The following domains were annotated:
Domain Number of words Time span
In addition, some 60,000 tokens of spoken dialogues in different Italian varieties were annotated.