Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

server: add model alias presets examples python python script changes server
#14083 opened Jun 9, 2025 by am17an Loading…
kv-cache : fix shift
#14081 opened Jun 9, 2025 by ggerganov Draft
Implement GGML_CPU_ALL_VARIANTS for ARM ggml changes relating to the ggml tensor library for machine learning
#14080 opened Jun 9, 2025 by ckastner Loading…
graph : fix geglu
#14077 opened Jun 9, 2025 by ggerganov Loading…
rpc: nicer error message for RPC server crash ggml changes relating to the ggml tensor library for machine learning
#14076 opened Jun 9, 2025 by isaac-mcfadyen Loading…
llama: automatically set runtime parameters such as --n-gpu-layers to fit VRAM ggml changes relating to the ggml tensor library for machine learning
#14067 opened Jun 8, 2025 by JohannesGaessler Draft
vulkan : fix build failure caused by vulkan-shaders-gen install ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14047 opened Jun 6, 2025 by AsbjornOlling Loading…
ggml-cpu: optimise assembly calls for hsum on s390x
#14037 opened Jun 5, 2025 by taronaeo Loading…
llama : add thread safety test devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#14035 opened Jun 5, 2025 by slaren Loading…
sycl: Adding additional cpy dbg print output ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14034 opened Jun 5, 2025 by ShanoToni Loading…
cuda : fix device sync on buffer clear ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#14033 opened Jun 5, 2025 by slaren Loading…
cpu: Update RISC-V condition to require GCC version 14 or higher ggml changes relating to the ggml tensor library for machine learning
#14032 opened Jun 5, 2025 by Ghosts381937 Loading…
llama : support qwen3 rerank and embeddings examples python python script changes server
#14029 opened Jun 5, 2025 by ngxson Loading…
ggml-cpu: fix uncaught underscore terminators for s390x ggml changes relating to the ggml tensor library for machine learning
#14023 opened Jun 5, 2025 by taronaeo Loading…
tests : add test-tokenizers-repo testing Everything test related
#14017 opened Jun 4, 2025 by CISC Loading…
llama: Attempt to add ModernBert python python script changes
#14014 opened Jun 4, 2025 by huydt84 Loading…
llama-chat : Do not throw when tool parsing fails
#14012 opened Jun 4, 2025 by p1-0tr Loading…
opencl: preliminary support for Q4_0 mul_mat_id using matvec ggml changes relating to the ggml tensor library for machine learning
#14003 opened Jun 4, 2025 by lhez Loading…
[CANN]:Replace aclrtMemsetSync with InplaceZero operator for zero tensor creation Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#14002 opened Jun 4, 2025 by luyhcsu Loading…
llama : allow building all tests on windows when not using shared libs devops improvements to build systems and github actions testing Everything test related
#13980 opened Jun 2, 2025 by slaren Loading…
Hybrid recurrent cache
#13979 opened Jun 2, 2025 by gabe-l-hart Loading…
ProTip! Adding no:label will show everything without a label.