Skip to content

Latest commit

 

History

History
28 lines (22 loc) · 716 Bytes

File metadata and controls

28 lines (22 loc) · 716 Bytes

readme for People's Daily(人民日报) dataset

Task

Named Entity Recognition

Description

Tags: LOC(地名), ORG(机构名), PER(人名)
Tag Strategy:BIO
Split: 'space' (北 B-LOC)
Data Size:
Train data set ( example.train ):

句数 字符数 LOC数 ORG数 PER数
20864 979180 16571 9277 8144

Dev data set ( example.dev ):

句数 字符数 LOC数 ORG数 PER数
2318 109870 1951 984 884

Test data set ( example.test )

句数 字符数 LOC数 ORG数 PER数
4636 219197 3658 2185 1864

Reference:
https://github.com/zjy-ucas/ChineseNER