diff --git a/data/xml/2020.acl.xml b/data/xml/2020.acl.xml
index 2cde863a38..5eacc2812f 100644
--- a/data/xml/2020.acl.xml
+++ b/data/xml/2020.acl.xml
@@ -2496,6 +2496,7 @@
10.18653/v1/2020.acl-main.169
madaan-etal-2020-politeness
+ tag-and-generate/Politeness-Transfer-A-Tag-and-Generate-Approach
BPE-Dropout: Simple and Effective Subword Regularization
diff --git a/data/xml/2020.coling.xml b/data/xml/2020.coling.xml
index b82a7e044e..00157c5592 100644
--- a/data/xml/2020.coling.xml
+++ b/data/xml/2020.coling.xml
@@ -8167,6 +8167,7 @@
2020.coling-demos.2
10.18653/v1/2020.coling-demos.2
akhbardeh-etal-2020-maintnet
+ 10,000 People - Human Pose Recognition Data
DART: A Lightweight Quality-Suggestive Data-to-Text Annotation Tool
diff --git a/data/xml/2020.emnlp.xml b/data/xml/2020.emnlp.xml
index fe09c0df81..edd1dae00f 100644
--- a/data/xml/2020.emnlp.xml
+++ b/data/xml/2020.emnlp.xml
@@ -6088,6 +6088,7 @@
10.18653/v1/2020.emnlp-main.406
gao-gormley-2020-training
+ CoNLL-2003
Multilevel Text Alignment with Cross-Document Attention
diff --git a/data/xml/2021.acl.xml b/data/xml/2021.acl.xml
index 6d029c1a2c..09460771be 100644
--- a/data/xml/2021.acl.xml
+++ b/data/xml/2021.acl.xml
@@ -9168,7 +9168,7 @@
Outstanding Paper
10.18653/v1/2021.acl-long.568
aghajanyan-etal-2021-intrinsic
- rabeehk/compacter
+ rabeehk/compacter
ANLI
GLUE
MRPC
diff --git a/data/xml/2021.emnlp.xml b/data/xml/2021.emnlp.xml
index 97075299ff..e11d071b56 100644
--- a/data/xml/2021.emnlp.xml
+++ b/data/xml/2021.emnlp.xml
@@ -7500,6 +7500,7 @@
liu-etal-2021-effective
10.18653/v1/2021.emnlp-main.481
+ MIMIC-III
Contrastive Code Representation Learning
diff --git a/data/xml/2021.naacl.xml b/data/xml/2021.naacl.xml
index 7843555a3b..00f29ba602 100644
--- a/data/xml/2021.naacl.xml
+++ b/data/xml/2021.naacl.xml
@@ -1194,7 +1194,7 @@
10.18653/v1/2021.naacl-main.77
sun-etal-2021-lightningdot
- intersun/LightningDOT
+ intersun/LightningDOT
COCO
@@ -4168,6 +4168,7 @@
iida-etal-2021-tabbie
SFIG611/tabbie
+ VizNet-Sato
Better Feature Integration for Named Entity Recognition
diff --git a/data/xml/2022.acl.xml b/data/xml/2022.acl.xml
index 0d7b606a62..d7e3af9019 100644
--- a/data/xml/2022.acl.xml
+++ b/data/xml/2022.acl.xml
@@ -6536,6 +6536,7 @@ in the Case of Unambiguous Gender
CSQA
ComplexWebQuestions
MetaQA
+ SimpleQuestions
WebQuestions
@@ -7692,6 +7693,9 @@ in the Case of Unambiguous Gender
2022.acl-long.498.software.zip
zhou-etal-2022-distantly
10.18653/v1/2022.acl-long.498
+ kangISU/Conf-MPU-DS-NER
+ BC5CDR
+ CoNLL-2003
UniXcoder: Unified Cross-Modal Pre-training for Code Representation
diff --git a/data/xml/2022.fl4nlp.xml b/data/xml/2022.fl4nlp.xml
index d97c343de1..9a8168d6be 100644
--- a/data/xml/2022.fl4nlp.xml
+++ b/data/xml/2022.fl4nlp.xml
@@ -50,7 +50,6 @@
2022.fl4nlp-1.2
ro-etal-2022-scaling
10.18653/v1/2022.fl4nlp-1.2
- Billion Word Benchmark
Adaptive Differential Privacy for Language Model Training
diff --git a/data/xml/D14.xml b/data/xml/D14.xml
index d06aa1554a..3400c0fbe6 100644
--- a/data/xml/D14.xml
+++ b/data/xml/D14.xml
@@ -1716,6 +1716,7 @@
D14-1162
10.3115/v1/D14-1162
pennington-etal-2014-glove
+ stanfordnlp/GloVe
CoNLL-2003
diff --git a/data/xml/D18.xml b/data/xml/D18.xml
index 0c16259cfe..aa468bb36d 100644
--- a/data/xml/D18.xml
+++ b/data/xml/D18.xml
@@ -1515,6 +1515,7 @@
hayati-etal-2018-retrieval
sweetpeach/ReCode
Django
+ Hearthstone
SQL-to-Text Generation with Graph-to-Sequence Model
@@ -2115,6 +2116,7 @@
10.18653/v1/D18-1156
liu-etal-2018-jointly
lx865712528/JMEE
+ ACE 2005
RESIDE: Improving Distantly-Supervised Neural Relation Extraction using Side Information
@@ -2635,6 +2637,7 @@
iyer-etal-2018-mapping
sriniiyer/concode
CONCODE
+ Hearthstone
SyntaxSQLNet: Syntax Tree Networks for Complex and Cross-Domain Text-to-SQL Task
@@ -7733,6 +7736,7 @@
This paper describes SentencePiece, a language-independent subword tokenizer and detokenizer designed for Neural-based text processing, including Neural Machine Translation. It provides open-source C++ and Python implementations for subword units. While existing subword segmentation tools assume that the input is pre-tokenized into word sequences, SentencePiece can train subword models directly from raw sentences, which allows us to make a purely end-to-end and language independent system. We perform a validation experiment of NMT on English-Japanese machine translation, and find that it is possible to achieve comparable accuracy to direct subword training from raw sentences. We also compare the performance of subword training and segmentation with various configurations. SentencePiece is available under the Apache 2 license at https://github.com/google/sentencepiece.
10.18653/v1/D18-2012
kudo-richardson-2018-sentencepiece
+
CogCompTime: A Tool for Understanding Time in Natural Language
diff --git a/data/xml/I17.xml b/data/xml/I17.xml
index 6b23e7d944..7edd0b7bd1 100644
--- a/data/xml/I17.xml
+++ b/data/xml/I17.xml
@@ -1755,6 +1755,7 @@
miceli-barone-sennrich-2017-parallel
Avmb/code-docstring-corpus
Django
+ Hearthstone
Building Large Chinese Corpus for Spoken Dialogue Research in Specific Domains
diff --git a/data/xml/N19.xml b/data/xml/N19.xml
index b543084c11..001e220177 100644
--- a/data/xml/N19.xml
+++ b/data/xml/N19.xml
@@ -1719,6 +1719,7 @@
10.18653/v1/N19-1124
cai-etal-2019-skeleton
+ jcyk/skeleton-to-response
Jointly Optimizing Diversity and Relevance in Neural Response Generation
@@ -4737,6 +4738,7 @@
10.18653/v1/N19-1349
ko-etal-2019-linguistically
+ wjko2/Linguistically-Informed-Specificity-and-Semantic-Plausibility-for-Dialogue-Generation
CoLA
diff --git a/data/xml/P16.xml b/data/xml/P16.xml
index 4b6c14349d..d0df8d6d7e 100644
--- a/data/xml/P16.xml
+++ b/data/xml/P16.xml
@@ -640,6 +640,7 @@
10.18653/v1/P16-1057
ling-etal-2016-latent
deepmind/card2code
+ Hearthstone
Django