The “CockrACE” corpus consists of 140 news articles annotated with mentions of entities and their coreference links, as well as relation mentions for the evaluation of relation extraction (RE) experiments. Three semantic relations have been annotated, each of them dealing with people's family relationships (marriages, brother/sister, parent/child). This annotation effort is ongoing. We are providing here a snapshot of the (more-or-less) raw annotation. The files can be loaded with the Recon tool.
The DFKI CockrACE Corpus is released as CC-BY 4.0. If you use this data, you should cite the accompanying paper:
Language Resources and Annotation Tools for Cross-Sentence Relation Extraction. Sebastian Krause, Hong Li, Feiyu Xu, Hans Uszkoreit, Robert Hummel, and Luise Spielhagen. Proceedings of LREC, 2014. (bib) (pdf)