From 02f159e23bf1d81e5485826b5bf94a3b5e28037b Mon Sep 17 00:00:00 2001 From: Philip May Date: Wed, 5 Jun 2019 13:37:13 +0200 Subject: [PATCH] Update README.md --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index bf90261..bc12649 100644 --- a/README.md +++ b/README.md @@ -2,3 +2,5 @@ This is a german text corpus from Wikipedia. It is cleaned and preprocessed and useful to train NLP embeddings for example. As Wikipedia itself this is published under [Creative Commons Attribution-ShareAlike 3.0 Unported license](https://de.wikipedia.org/wiki/Wikipedia:Lizenzbestimmungen_Creative_Commons_Attribution-ShareAlike_3.0_Unported). + +You can download the texts here: https://github.com/t-systems-on-site-services-gmbh/german-wikipedia-text-corpus/releases/tag/files_1