You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Affects all llama builds since e0dbec0, tested up to
version: 4941 (ba932df)
built with cc (Ubuntu 13.3.0-6ubuntu2-24.04) 13.3.0 for x86_64-linux-gnu
bug not present in
version: 4879 (f08f4b3)
built with cc (Ubuntu 13.3.0-6ubuntu2-24.04) 13.3.0 for x86_64-linux-gnu
Operating systems
Linux
Which llama.cpp modules do you know to be affected?
libllama (core library)
Command line
# Can be replicated with any model, here using Llama-3.3# (-b/-c to reduce memory usages, but not relevant to the bug - can use model ctx size)
llama-embedding -m Llama-3.3-70B-Instruct-Q6_K-00001-of-00002.gguf -ngl 90 -b 2048 -c 2048 -p 'hello, world' --pooling mean
Problem description & steps to reproduce
Fails in llm_graph_context::build_pooling with:
llama.cpp/ggml/src/ggml.c:2738: GGML_ASSERT(ggml_can_mul_mat(a, b)) failed
Reproduce with any model using llama-embedding --pooling mean, for example:
Name and Version
Affects all llama builds since e0dbec0, tested up to
version: 4941 (ba932df)
built with cc (Ubuntu 13.3.0-6ubuntu2-24.04) 13.3.0 for x86_64-linux-gnu
bug not present in
version: 4879 (f08f4b3)
built with cc (Ubuntu 13.3.0-6ubuntu2-24.04) 13.3.0 for x86_64-linux-gnu
Operating systems
Linux
Which llama.cpp modules do you know to be affected?
libllama (core library)
Command line
Problem description & steps to reproduce
Fails in
llm_graph_context::build_pooling
with:llama.cpp/ggml/src/ggml.c:2738: GGML_ASSERT(ggml_can_mul_mat(a, b)) failed
Reproduce with any model using
llama-embedding --pooling mean
, for example:The error is due to mismatch between
inp
andinp_mean
tensors in llama-graph.cpp@:1626.Run with additional output printing
nelements
andnrows
ofinp
andinp_mean
:run before with llama 4879 (f08f4b3), i.e., before e0dbec0 (#12181):
First Bad Commit
e0dbec0
Relevant log output
The text was updated successfully, but these errors were encountered: