Bug: Server hangs when number of threads used for decoding > number of CPUs it runs on #10397
Labels
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
What happened?
As the title suggests, this will cause the server to hang,
while this will not
To reproduce, the client can be simply
curl
as in the provided example:The client gets no response or error in the ill case.
Name and Version
What operating system are you seeing the problem on?
Linux
Relevant log output
The text was updated successfully, but these errors were encountered: