Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

llama-bench : Add --override-tensors arg examples
#12922 opened Apr 12, 2025 by 4onen Loading…
SYCL: Fix im2col ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12910 opened Apr 12, 2025 by qnixsynapse Loading…
Get CPU model in ggml_backend_cpu_device_context on FreeBSD ggml changes relating to the ggml tensor library for machine learning
#12902 opened Apr 11, 2025 by yurivict Loading…
cuda: fix compilation error (#12893) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12894 opened Apr 11, 2025 by lizhenneng Loading…
ggml: disable CUDA graphs for unsupported DUP and CONT node types ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12891 opened Apr 11, 2025 by agray3 Loading…
llama-tts : input from stdin examples
#12890 opened Apr 11, 2025 by marcoStocchi Loading…
SYCL: Add ROPE vision kernel ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12887 opened Apr 11, 2025 by qnixsynapse Loading…
opencl: split ggml-opencl.cl into multiple files and cleanup ggml changes relating to the ggml tensor library for machine learning
#12886 opened Apr 11, 2025 by lhez Loading…
[CANN]feat: Increase the way memory allocation is managed Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#12875 opened Apr 10, 2025 by bachelor-dou Draft
ggml : add SSE 4.2 and x64 base variant for CPUs without AVX ggml changes relating to the ggml tensor library for machine learning
#12871 opened Apr 10, 2025 by slaren Loading…
opencl: fix incorrect local_size index in profiling log ggml changes relating to the ggml tensor library for machine learning
#12868 opened Apr 10, 2025 by kimminsu38oo Loading…
[CANN]Opt ROPE optimization Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#12865 opened Apr 10, 2025 by noemotiovon Loading…
CANN: add async task submit Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#12864 opened Apr 10, 2025 by hipudding Draft
sycl : implementation of reordered Q4_0 MMVQ for Intel GPUs ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12858 opened Apr 10, 2025 by Alcpz Loading…
2 of 3 tasks
gguf-py: byteswapping improvements python python script changes
#12851 opened Apr 9, 2025 by AlekseiNikiforovIBM Loading…
metal : add memory pool for temp allocs Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#12850 opened Apr 9, 2025 by ggerganov Draft
6 of 8 tasks
Llama-3_1-Nemotron-Ultra-253B-v1 support python python script changes
#12843 opened Apr 9, 2025 by ymcki Loading…
convert : write tensors in parallel performance Speed related topics python python script changes
#12837 opened Apr 8, 2025 by compilade Loading…
3 of 6 tasks
Add AVX512 implementation of GEMM - Q4_Kx8 ggml changes relating to the ggml tensor library for machine learning
#12829 opened Apr 8, 2025 by Srihari-mcw Loading…
common: add partial regex support examples server testing Everything test related
#12808 opened Apr 7, 2025 by ochafik Loading…
ProTip! Follow long discussions with comments:>50.