-
Notifications
You must be signed in to change notification settings - Fork 357
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Matching with Synonyms using KeyLLM OR KeyBERT #245
Comments
I think it should be possible if you use it as a candidate word. Have you tried it out? |
In this result the acronym and synonyms are not identified by KeyBERT
The code used
Is there some way to resolve this? |
Ah right, that's because the candidates should appear in the original document in order to find them. Instead, you might want to use the |
But do you know why its require the word itself to appear in the text? What I understood from the documentation is it uses embeddings and cosine similarity. Aint it enough to understand similar words or synonyms from the text and candidates? |
@ChettakattuA That depends on what you want. Generally, keywords are derived directly from the article that was written for SEO reasons. In KeyBERT Lines 163 to 182 in f0f96a6
|
I have been playing with KeyBERT and KeyLLM for a while now. And here is something I would like to achieve.
If have a text "CO2 emissions are high these days" and a list of candidate words, which might contain the word Carbondioxide and not CO2 will KeyBERT or KeyLLM find Carbondioxide as a match?
Text = "CO2 emissions are high these days"
candidate keyword list have the word ["Carbon dioxide"] and not "CO2"
Expected output = ["Carbon dioxide"]
The text was updated successfully, but these errors were encountered: