From e123c5c4fd075cac17b0db38e8c1516c9c2186d1 Mon Sep 17 00:00:00 2001 From: Philip May Date: Tue, 4 Jun 2019 08:59:12 +0200 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index dfedac4..bf90261 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,4 @@ # German Wikipedia Text Corpus -This is a german text corpus from Wikipedia. It is useful to train NLP embeddings for example. +This is a german text corpus from Wikipedia. It is cleaned and preprocessed and useful to train NLP embeddings for example. As Wikipedia itself this is published under [Creative Commons Attribution-ShareAlike 3.0 Unported license](https://de.wikipedia.org/wiki/Wikipedia:Lizenzbestimmungen_Creative_Commons_Attribution-ShareAlike_3.0_Unported).