Resource: A Chinese Reading Comprehension Dataset for the 1st Chinese Machine Reading Comprehension Evaluation

Reference A Chinese Reading Comprehension Dataset for the 1st Chinese Machine Reading Comprehension Evaluation
Date of Submission Sept. 13, 2017, 11:13 a.m.
Status accepted
ISLRN 451-824-550-408-2
Resource Type Primary Text
Media Type Text
Source
Language Chinese
Format/MIME Type text
Access Medium Web Download
Description

Machine Reading Comprehension (MRC) has become enormously popular recently and has attracted a lot of attentions. However, existing reading comprehension datasets are mostly in English. To add diversity in reading comprehension datasets, in this paper we propose a new Chinese reading comprehension dataset for accelerating related research in the community. The proposed dataset contains two different type: cloze-style reading comprehension and user query style reading comprehension, both associated with large-scale training data as well as human-generated validation and hidden test set. Along with this dataset, we also host the first Evaluation of Chinese Machine Reading Comprehension (CMRC-2017) and successfully attracted tens of participants, which suggest the potential impact of this dataset.

Version 1.1
Creator Wentao Ma , Guoping Hu , Shijin Wang , Zhipeng Chen , Ting Liu , Yiming Cui - iFLYTEK Research
Distributor Yiming Cui - iFLYTEK Research
Rights Holder Yiming Cui - iFLYTEK Research