ISLRN

A Chinese Reading Comprehension Dataset for the 1st Chinese Machine Reading Comprehension Evaluation

Full Official Name: A Chinese Reading Comprehension Dataset for the 1st Chinese Machine Reading Comprehension Evaluation

Submission date: Sept. 13, 2017, 11:13 a.m.

Machine Reading Comprehension (MRC) has become enormously popular recently and has attracted a lot of attentions. However, existing reading comprehension datasets are mostly in English. To add diversity in reading comprehension datasets, in this paper we propose a new Chinese reading comprehension dataset for accelerating related research in the community. The proposed dataset contains two different type: cloze-style reading comprehension and user query style reading comprehension, both associated with large-scale training data as well as human-generated validation and hidden test set. Along with this dataset, we also host the first Evaluation of Chinese Machine Reading Comprehension (CMRC-2017) and successfully attracted tens of participants, which suggest the potential impact of this dataset.

Creator(s)

iFLYTEK Research - Yiming Cui

Ting Liu

Zhipeng Chen

Shijin Wang

Guoping Hu

Wentao Ma

Distributor(s)

iFLYTEK Research - Yiming Cui

Right Holder(s)

iFLYTEK Research - Yiming Cui

Status : Accepted

ISLRN :

451-824-550-408-2

Version

1.1

Source

https://github.com/ymcui/cmrc2017

Resource Type

Primary Text

Media Type

Text

Language(s)

Chinese

Access Medium

Web Download