A Chinese Reading Comprehension Dataset for the 1st Chinese Machine Reading Comprehension Evaluation

Full Official Name: A Chinese Reading Comprehension Dataset for the 1st Chinese Machine Reading Comprehension Evaluation
Submission date: Sept. 13, 2017, 11:13 a.m.

Machine Reading Comprehension (MRC) has become enormously popular recently and has attracted a lot of attentions. However, existing reading comprehension datasets are mostly in English. To add diversity in reading comprehension datasets, in this paper we propose a new Chinese reading comprehension dataset for accelerating related research in the community. The proposed dataset contains two different type: cloze-style reading comprehension and user query style reading comprehension, both associated with large-scale training data as well as human-generated validation and hidden test set. Along with this dataset, we also host the first Evaluation of Chinese Machine Reading Comprehension (CMRC-2017) and successfully attracted tens of participants, which suggest the potential impact of this dataset.

Creator(s)
Distributor(s)
Right Holder(s)