Skip to content

Commit

Permalink
new reranking and embeddings
Browse files Browse the repository at this point in the history
  • Loading branch information
mgonzs13 committed Jan 10, 2025
1 parent 1292f11 commit 46f3f05
Show file tree
Hide file tree
Showing 3 changed files with 19 additions and 4 deletions.
9 changes: 9 additions & 0 deletions llama_bringup/models/All-MiniLM-L6-v2.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
n_ctx: 2048
n_batch: 1024
n_gpu_layers: 0
n_threads: 1
n_predict: 2048
embedding: true

model_repo: "second-state/All-MiniLM-L6-v2-Embedding-GGUF"
model_filename: "all-MiniLM-L6-v2-ggml-model-f16.gguf"
9 changes: 9 additions & 0 deletions llama_bringup/models/bge-reranker-v2-m3.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
n_ctx: 2048
n_batch: 1024
n_gpu_layers: 0
n_threads: -1
n_predict: 2048
reranking: true

model_repo: "gpustack/bge-reranker-v2-m3-GGUF"
model_filename: "bge-reranker-v2-m3-Q4_K_M.gguf"
5 changes: 1 addition & 4 deletions llama_demos/llama_demos/chatllama_tools_demo_node.py
Original file line number Diff line number Diff line change
Expand Up @@ -66,10 +66,7 @@ def send_prompt(self) -> None:
]

self.get_logger().info(f"\nPrompt: {messages[0].content}")

llm_tools = self.chat.bind_tools(
[get_inhabitants, get_curr_temperature], tool_choice="any"
)
llm_tools = self.chat.bind_tools([get_inhabitants, get_curr_temperature])

self.initial_time = time.time()
all_tools_res = llm_tools.invoke(messages)
Expand Down

0 comments on commit 46f3f05

Please sign in to comment.