Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
Changed "linguistic resources" to "lemmatization rules", since CSTlemma has options to use additional linguistic resources.
  • Loading branch information
BartJongejan authored Jan 11, 2022
1 parent 75b31f5 commit d031411
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# tinylemmatizer
Simple lemmatizer that uses the same linguistic resources as CSTlemma
Simple lemmatizer that uses the same lemmatization rules as CSTlemma.

This project consists of a Python3 wrapper around a small C-program that lemmatizes full forms using a rule set ('flexrules') that is in the same binary format as the rule sets used by cstlemma (https://github.com/kuhumcst/cstlemma).
Rule sets can be trained using the affixtrain proram (https://github.com/kuhumcst/affixtrain). Alternatively, rule sets can be downloaded from https://github.com/kuhumcst/texton-linguistic-resources. Look for files in folders such as
Expand Down

0 comments on commit d031411

Please sign in to comment.