-
Notifications
You must be signed in to change notification settings - Fork 11.4k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: Add Clear All Conversations for llama-server web-ui
examples
server
#12924
opened Apr 12, 2025 by
characharm
Loading…
SYCL: Fix im2col
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12910
opened Apr 12, 2025 by
qnixsynapse
Loading…
mtmd : add methods to access
mtmd_image_tokens
examples
#12906
opened Apr 11, 2025 by
ngxson
Loading…
Get CPU model in ggml_backend_cpu_device_context on FreeBSD
ggml
changes relating to the ggml tensor library for machine learning
#12902
opened Apr 11, 2025 by
yurivict
Loading…
cuda: fix compilation error (#12893)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12894
opened Apr 11, 2025 by
lizhenneng
Loading…
ggml: disable CUDA graphs for unsupported DUP and CONT node types
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12891
opened Apr 11, 2025 by
agray3
Loading…
SYCL: Add ROPE vision kernel
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12887
opened Apr 11, 2025 by
qnixsynapse
Loading…
opencl: split changes relating to the ggml tensor library for machine learning
ggml-opencl.cl
into multiple files and cleanup
ggml
#12886
opened Apr 11, 2025 by
lhez
Loading…
[CANN]feat: Increase the way memory allocation is managed
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#12875
opened Apr 10, 2025 by
bachelor-dou
•
Draft
llama-bench: enhance benchmark with improved token throughput measurements
examples
#12874
opened Apr 10, 2025 by
thevishalagarwal
Loading…
ggml : add SSE 4.2 and x64 base variant for CPUs without AVX
ggml
changes relating to the ggml tensor library for machine learning
#12871
opened Apr 10, 2025 by
slaren
Loading…
opencl: fix incorrect local_size index in profiling log
ggml
changes relating to the ggml tensor library for machine learning
#12868
opened Apr 10, 2025 by
kimminsu38oo
Loading…
[CANN]Opt ROPE optimization
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#12865
opened Apr 10, 2025 by
noemotiovon
Loading…
CANN: add async task submit
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
sycl : implementation of reordered Q4_0 MMVQ for Intel GPUs
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12858
opened Apr 10, 2025 by
Alcpz
Loading…
2 of 3 tasks
gguf-py: byteswapping improvements
python
python script changes
#12851
opened Apr 9, 2025 by
AlekseiNikiforovIBM
Loading…
metal : add memory pool for temp allocs
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
Llama-3_1-Nemotron-Ultra-253B-v1 support
python
python script changes
#12843
opened Apr 9, 2025 by
ymcki
Loading…
convert : write tensors in parallel
performance
Speed related topics
python
python script changes
#12837
opened Apr 8, 2025 by
compilade
Loading…
3 of 6 tasks
llamax : add a possible implementation of a simple API for llama.cpp …
build
Compilation issues
#12835
opened Apr 8, 2025 by
cyrilleberger
Loading…
Add AVX512 implementation of GEMM - Q4_Kx8
ggml
changes relating to the ggml tensor library for machine learning
#12829
opened Apr 8, 2025 by
Srihari-mcw
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.