- Content
The following resource is the TermITH-Eval dataset presented at LREC in May 2016. Each folder contains 100 documents annotated by the keyphrase extraction methods -- TF-IDF, KEA and TopicRank -- and manually evaluated by professional indexers (folder 'corpus_sciencesInfo' contains only 99 documents).
- License
CC By 4.0 (http://creativecommons.org/licenses/by/4.0/)
The Creative Commons Attribution 4.0 International License applies to this resource. Any re-use of this resource should attribute its content to 'Corpus TermITH-Eval created in the framework of the TermITH-project (ANR-12-CORD-0029) under the responsibility of INIST-CNRS, LINA, INRIA'.