-
Notifications
You must be signed in to change notification settings - Fork 11.4k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
sync : ggml
ggml
changes relating to the ggml tensor library for machine learning
script
Script related
#12935
opened Apr 14, 2025 by
ggerganov
Loading…
CUDA/HIP: Share the same unified memory allocation logic.
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12934
opened Apr 14, 2025 by
hjc4869
Loading…
vulkan: enable coopmat2 FA gqa and split_k optimizations more often
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#12931
opened Apr 13, 2025 by
jeffbolznv
Loading…
gguf-py : GGUF Editor GUI - Python + Qt
python
python script changes
#12930
opened Apr 13, 2025 by
christopherthompson81
Loading…
llava: add performance print for gemma3 example
examples
#12929
opened Apr 13, 2025 by
Russyyds
Loading…
feat: Add Clear All Conversations for llama-server web-ui
examples
server
#12924
opened Apr 12, 2025 by
characharm
Loading…
SYCL: Fix im2col
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12910
opened Apr 12, 2025 by
qnixsynapse
Loading…
mtmd : add methods to access
mtmd_image_tokens
examples
#12906
opened Apr 11, 2025 by
ngxson
Loading…
Get CPU model in ggml_backend_cpu_device_context on FreeBSD
ggml
changes relating to the ggml tensor library for machine learning
#12902
opened Apr 11, 2025 by
yurivict
Loading…
cuda: fix compilation error (#12893)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12894
opened Apr 11, 2025 by
lizhenneng
Loading…
SYCL: Add ROPE vision kernel
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12887
opened Apr 11, 2025 by
qnixsynapse
Loading…
opencl: split changes relating to the ggml tensor library for machine learning
ggml-opencl.cl
into multiple files and cleanup
ggml
#12886
opened Apr 11, 2025 by
lhez
Loading…
[CANN]feat: Increase the way memory allocation is managed
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#12875
opened Apr 10, 2025 by
bachelor-dou
•
Draft
llama-bench: enhance benchmark with improved token throughput measurements
examples
#12874
opened Apr 10, 2025 by
thevishalagarwal
Loading…
ggml : add SSE 4.2 and x64 base variant for CPUs without AVX
ggml
changes relating to the ggml tensor library for machine learning
#12871
opened Apr 10, 2025 by
slaren
Loading…
opencl: fix incorrect local_size index in profiling log
ggml
changes relating to the ggml tensor library for machine learning
#12868
opened Apr 10, 2025 by
kimminsu38oo
Loading…
[CANN]Opt ROPE optimization
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#12865
opened Apr 10, 2025 by
noemotiovon
Loading…
CANN: add async task submit
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
sycl : implementation of reordered Q4_0 MMVQ for Intel GPUs
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12858
opened Apr 10, 2025 by
Alcpz
Loading…
2 of 3 tasks
gguf-py: byteswapping improvements
python
python script changes
#12851
opened Apr 9, 2025 by
AlekseiNikiforovIBM
Loading…
metal : add memory pool for temp allocs
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
Llama-3_1-Nemotron-Ultra-253B-v1 support
python
python script changes
#12843
opened Apr 9, 2025 by
ymcki
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.