Skip to content

v0.3.4

Compare
Choose a tag to compare
@MaartenGr MaartenGr released this 05 Nov 10:41
· 13 commits to master since this release
241d7d3
  • Make sure that when you use two lists that are exactly the same, it will return 1 for identical terms:
from polyfuzz import PolyFuzz
from_list = ["apple", "house"]
model = PolyFuzz("TF-IDF")
model.match(from_list, from_list)

This will match each word in from_list to itself and give it a score of 1. Thus, apple will be matched to apple and
house will be mapped to house. However, if you input just a single list, it will try to map them within the list without
mapping to itself:

from polyfuzz import PolyFuzz
from_list = ["apple", "apples"]
model = PolyFuzz("TF-IDF")
model.match(from_list)

In the example above, apple will be mapped to apples and not to apple. Here, we assume that the user wants to
find the most similar words within a list without mapping to itself.