Skip to content

4.0.0

Compare
Choose a tag to compare
@mgonzs13 mgonzs13 released this 03 Oct 11:48
· 148 commits to main since this release
  • reranking added
  • separate LLM, embedding models and reranking models
  • new services (reranking and detokenize)
  • models for reranking and embeddings added
  • vicuna promopt added
  • llama namespace removed from LlamaClientNode
  • full demo with LLM + chat template + RAG + reranking + stream
  • README:
    • model shards example added
    • reranking langchain and demo added
    • embedding demo added
    • minor fixes
  • langchain reranking added
  • langchain upgraded to 0.3
  • llama.cpp b3870