Replies: 5 comments 5 replies
-
I'm very interested in this topic since I agree that Semantic Chunking should be improved. During my research, I found that the logic behind the Late Chunking technique might solve many of the performance issues caused by traditional chunking methods. I have outlined a flow to manually implement the Late Chunking technique in Langchain using the features currently supported by Langchain (I haven't tested it yet, but I plan to do so soon and share the results later).
The flow I outlined above is just a concept I'm exploring right now. The main challenge is determining the optimal chunk size, so I'll be testing this soon and will share the results once I have them. I’d greatly appreciate any recommendations for improving my approach or suggestions for alternative methods that might work better! |
Beta Was this translation helpful? Give feedback.
-
Hey guys, same here, it would be nice to see implementation for late chunking, we tried our custom flow similar to what is already described here but it was really weak. Anyone has idea? Tnx |
Beta Was this translation helpful? Give feedback.
-
I'm in the same boat here. I'm especially interested in late-chunking in the context of addressing the "haystack in a haystack" problem that plagues scholarly RAGs. jina.ai gives an example notebook here and makes the code available here, but it's quite basic and custom and not readily compatible with typical RAG workflows using vector databases and so on. |
Beta Was this translation helpful? Give feedback.
-
is there any development with regard to topic? It's been a long time since the opening time of discussion. It will be vital for late chunking to be integrated into langchain system. Could you give us some information? |
Beta Was this translation helpful? Give feedback.
-
Have anyone implemented the code? |
Beta Was this translation helpful? Give feedback.
-
Checked
Feature request
It is an interesting new approach for chunking.
Motivation
The semantic chunking should be improved.
Proposal (If applicable)
No response
Beta Was this translation helpful? Give feedback.
All reactions