langchain_community: Checking text property first in neo4j to avoid duplicate nodes #17381

abhimalamkar · 2024-02-11T17:40:26Z

Checking text property first in neo4j to avoid duplicate nodes

@dev2049
@vowelparrot

vercel · 2024-02-11T17:40:31Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Feb 14, 2024 7:00am

tomasonjo · 2024-02-12T07:56:59Z

Merging on long text properties is expensive and not good. If you want, you can change the id calculation by using a cheap hash function

…ps with parent or child nodes

tomasonjo · 2024-02-12T22:57:28Z

All you need to do is to change this one line: https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/vectorstores/neo4j_vector.py#L429

No need to introduce new params or anything

abhimalamkar · 2024-02-13T00:04:42Z

@tomasonjo This diff now also handles creation of relationships to parent and child nodes.

tomasonjo · 2024-02-13T07:02:34Z

Not really, this is way too specific and not general to be merged in

abhimalamkar · 2024-02-13T19:48:01Z

@tomasonjo This does not break any existing functionality but allow the ability to create relationships with other exiting nodes while creating new docs.

No worries, I reverted my changes to just introduce text hash as ids

baskaryan · 2024-03-29T00:53:46Z

believe this was resolved in #18846, let me know if i'm missing something!

checking text property first in neo4j to avoid duplicate nodes

62076c3

dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Feb 11, 2024

dosubot bot added Ɑ: vector store Related to vector store module 🤖:improvement Medium size change to existing code to handle new use-cases labels Feb 11, 2024

adding hash function as id to not have duplicates. adding relationshi…

c12017d

…ps with parent or child nodes

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. and removed size:XS This PR changes 0-9 lines, ignoring generated files. labels Feb 12, 2024

abhimalamkar added 3 commits February 12, 2024 17:37

Merge branch 'master' into master

281e987

updated

63ce9df

Merge branch 'master' of https://github.com/abhimalamkar/langchain

f18e519

fixing params

e380ada

reverting changes

e6d059a

dosubot bot added size:XS This PR changes 0-9 lines, ignoring generated files. and removed size:M This PR changes 30-99 lines, ignoring generated files. labels Feb 13, 2024

abhimalamkar added 2 commits February 13, 2024 14:55

Merge branch 'master' into master

4fd37ac

Merge branch 'master' into master

d19a4b1

baskaryan closed this Mar 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

langchain_community: Checking text property first in neo4j to avoid duplicate nodes #17381

langchain_community: Checking text property first in neo4j to avoid duplicate nodes #17381

abhimalamkar commented Feb 11, 2024 •

edited

Loading

vercel bot commented Feb 11, 2024 •

edited

Loading

tomasonjo commented Feb 12, 2024 •

edited

Loading

tomasonjo commented Feb 12, 2024

abhimalamkar commented Feb 13, 2024 •

edited

Loading

tomasonjo commented Feb 13, 2024

abhimalamkar commented Feb 13, 2024 •

edited

Loading

baskaryan commented Mar 29, 2024

langchain_community: Checking text property first in neo4j to avoid duplicate nodes #17381

langchain_community: Checking text property first in neo4j to avoid duplicate nodes #17381

Conversation

abhimalamkar commented Feb 11, 2024 • edited Loading

vercel bot commented Feb 11, 2024 • edited Loading

tomasonjo commented Feb 12, 2024 • edited Loading

tomasonjo commented Feb 12, 2024

abhimalamkar commented Feb 13, 2024 • edited Loading

tomasonjo commented Feb 13, 2024

abhimalamkar commented Feb 13, 2024 • edited Loading

baskaryan commented Mar 29, 2024

abhimalamkar commented Feb 11, 2024 •

edited

Loading

vercel bot commented Feb 11, 2024 •

edited

Loading

tomasonjo commented Feb 12, 2024 •

edited

Loading

abhimalamkar commented Feb 13, 2024 •

edited

Loading

abhimalamkar commented Feb 13, 2024 •

edited

Loading