Skip to content

sync : llama.cpp#1476

Merged
ggerganov merged 10 commits into
masterfrom
sync-llama.cpp-26-05-05
May 5, 2026
Merged

sync : llama.cpp#1476
ggerganov merged 10 commits into
masterfrom
sync-llama.cpp-26-05-05

Conversation

@ggerganov

Copy link
Copy Markdown
Member

No description provided.

shawngu-quic and others added 10 commits May 5, 2026 10:30
* MoE Mxfp4 CLC kernel added, router reorder on GPU

* Pass test-backend-ops for MoE mxfp4 Adreno CLC

* remove putenv in llama-model.cpp

* fix indent style and whitespace

* opencl: remove unnecessary headers

* opencl: do not save cl_program objects

* opencl: remove unnecessary assert

* fix precision issue

---------

Co-authored-by: Li He <lih@qti.qualcomm.com>
…irely) (llama/22533)

* fix: CUDA device PCI bus ID detection for multi-GPU de-dupe

* HIP, MUSA macros

---------

Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
* shader(norm): add layer norm ops

* shader(norm): stablize floating point computation with Kahan summation and handle mixed types

* shader(norm): remove the non-contiguous strides

* shader(norm): use the original implementation rather than the kahan summation
* llama : add option to save memory in device buffers

* tests : extend llama-save-load-state
@ggerganov

Copy link
Copy Markdown
Member Author

@taronaeo The python deps are failing to install on the self-hosted runner:

https://github.com/ggml-org/ggml/actions/runs/25366621707/job/74379012178#step:3:155

Not sure how to fix it. Any ideas?

@ggerganov ggerganov merged commit 5bb7236 into master May 5, 2026
26 of 32 checks passed
@ggerganov ggerganov deleted the sync-llama.cpp-26-05-05 branch May 5, 2026 10:13
@taronaeo

taronaeo commented May 5, 2026

Copy link
Copy Markdown
Member

@taronaeo The python deps are failing to install on the self-hosted runner:

https://github.com/ggml-org/ggml/actions/runs/25366621707/job/74379012178#step:3:155

Not sure how to fix it. Any ideas?

This

torch~=2.5.1
and this
torchvision>=0.15.2
are mismatching. The latest version of torchvision deprecated support for torch 2.5.X and since we are forcing torch to match versions within 2.5.X, pip can't find a suitable version to install.

I would suggest that we update and align the version requirements with llama.cpp (i.e., bumping torch version to torch~=2.6.0). Also to apply supply-chain hardening, same as what we did in llama.cpp since we currently are accepting versions greater than 0.15.2 for torchvision.

This problem is isolated to the GGML repository only.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

10 participants