Skip to content

Latest commit

 

History

History
33 lines (26 loc) · 1003 Bytes

README.md

File metadata and controls

33 lines (26 loc) · 1003 Bytes

MOSS

Evaulation Data used within "Sentence Compression for Arbitrary Languages via Multilingual Pivoting".

Data set

MOSS is a parallel corpus containing documents from the European parliament proceedings, TED talks, news commentaries, and the EU bookshop. Each document is written in English, French, and German, and compressed by native speakers of the respective language who process a document at a time. We obtain five compressions per document.

Citation

@InProceedings{D18-1267,
  author = 	"Mallinson, Jonathan
		and Sennrich, Rico
		and Lapata, Mirella",
  title = 	"Sentence Compression for Arbitrary Languages via Multilingual Pivoting",
  booktitle = 	"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing",
  year = 	"2018",
  publisher = 	"Association for Computational Linguistics",
  pages = 	"2453--2464",
  location = 	"Brussels, Belgium",
  url = 	"http://aclweb.org/anthology/D18-1267"
}