Resource: Monolingual Vietnamese Annotated Corpus

Reference Monolingual Vietnamese Annotated Corpus
Date of Submission June 2, 2021, 12:15 p.m.
Status accepted
ISLRN 004-081-406-421-7
Resource Type Primary Text
Media Type Text
Source
Language Vietnamese
Format/MIME Type text/xml
Size 100000 sentences
Description

The Monolingual Vietnamese Annotated Corpus consists of 100,000 sentences, manually annotated with word boundaries, POS, named entities, with an average length of 20 words per sentence. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.

Version 1.0
Distributor ELRA