From d031411788c0a00a32ca6e51e115f1fc398d7c1e Mon Sep 17 00:00:00 2001 From: Bart Jongejan Date: Tue, 11 Jan 2022 08:51:32 +0100 Subject: [PATCH] Update README.md Changed "linguistic resources" to "lemmatization rules", since CSTlemma has options to use additional linguistic resources. --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 69802fb..40861ad 100644 --- a/README.md +++ b/README.md @@ -1,5 +1,5 @@ # tinylemmatizer -Simple lemmatizer that uses the same linguistic resources as CSTlemma +Simple lemmatizer that uses the same lemmatization rules as CSTlemma. This project consists of a Python3 wrapper around a small C-program that lemmatizes full forms using a rule set ('flexrules') that is in the same binary format as the rule sets used by cstlemma (https://github.com/kuhumcst/cstlemma). Rule sets can be trained using the affixtrain proram (https://github.com/kuhumcst/affixtrain). Alternatively, rule sets can be downloaded from https://github.com/kuhumcst/texton-linguistic-resources. Look for files in folders such as