Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
PhilipMay authored Jun 5, 2019
1 parent e123c5c commit 02f159e
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,3 +2,5 @@
This is a german text corpus from Wikipedia. It is cleaned and preprocessed and useful to train NLP embeddings for example.

As Wikipedia itself this is published under [Creative Commons Attribution-ShareAlike 3.0 Unported license](https://de.wikipedia.org/wiki/Wikipedia:Lizenzbestimmungen_Creative_Commons_Attribution-ShareAlike_3.0_Unported).

You can download the texts here: https://github.com/t-systems-on-site-services-gmbh/german-wikipedia-text-corpus/releases/tag/files_1

0 comments on commit 02f159e

Please sign in to comment.