v0.3.4
- Make sure that when you use two lists that are exactly the same, it will return 1 for identical terms:
from polyfuzz import PolyFuzz
from_list = ["apple", "house"]
model = PolyFuzz("TF-IDF")
model.match(from_list, from_list)
This will match each word in from_list
to itself and give it a score of 1. Thus, apple
will be matched to apple
and
house
will be mapped to house
. However, if you input just a single list, it will try to map them within the list without
mapping to itself:
from polyfuzz import PolyFuzz
from_list = ["apple", "apples"]
model = PolyFuzz("TF-IDF")
model.match(from_list)
In the example above, apple
will be mapped to apples
and not to apple
. Here, we assume that the user wants to
find the most similar words within a list without mapping to itself.