You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hopefully you agree that a single word form can be transformed into 1+ baseforms. This is the main idea of my initial post: if no PoS information is available, it is reasonable to assume any PoS and produce all possible base forms. Here you are an example of two different lemmata having the same derived forms:
leaves leaf
leaves leave
If the left column is supposed to contain unique words only, how will multiple outcomes be given? Like this:
Zuschlage Zuschlag,zuschlagen
It is also possible to accomplish such merging at load/compile time. This way it is a little bit easier for the the users who may want to update the resource.
Situation: The baseform resource
de-lemma-utf8.txt
defines various outcomes for one input word, for example,I would expect that all outcomes will be returned, as the correct baseform depends on the part of speech.
If the resource is used case-insensitively, the number of such collisions will increase, now comprising cases like:
Would it be possible to fix the plugin to return all entries given in the resource?
Thanx
The text was updated successfully, but these errors were encountered: