Sync master with upstream release b8164 by jan-service-account · Pull Request #437 · janhq/llama.cpp

jan-service-account · 2026-02-27T00:47:04Z

Updates dev branch with latest release (b8164) from ggml-org/llama.cpp

Co-authored-by: Neo Zhang Jianyu <jianyu.zhang@intel.com>

fix typo

…org#19826) * WIP: Add EuroBERT support with autoformatting changes This commit includes: - EuroBERT model implementation for GGUF conversion - C++ backend support for EuroBERT architecture - Unintended autoformatting changes to Python files Saving before reverting formatting-only changes. * feat: add back eos assert when not last token pooling * feat: removed duplicated code and cleanup * feat: removed not working architectures and unnecessary check * fix: typo * fix: dynamic pooling config * feat: added an example model for eurobert * feat: proper llama-vocab implementation for jina-v5 * fix: removed unnecessary comments

Co-authored-by: Roman Marchenko <r.marchenko@ideco.ru>

* ggml-virtgpu-backend: validate the consistency of the received objects This patch adds consistency checks in the ggml-virtgpu-backend (running on the host side) to ensure that the data received from the guest is consistent (valid pointers, valid sizes and offsets). * ggml-virtgpu-backend: add fallback/skips for optional ggml backend methods ``` 1. bck->iface.synchronize(bck) 2. buft->iface.get_alloc_size(buft, op) 3. buft->iface.get_max_size(buft) ``` these three methods are optional in the GGML interface. `get_max_size` was already properly defaulted, but `backend sychronize` and `butf get_max_size` would have segfaulted the backend if not implemented. * ggml-virtgpu-backend: fix log format missing argument * ggml-virtgpu-backend: improve the abort message * ggml-virtgpu-backend: more safety checks * ggml-virtgpu-backend: new error code * ggml-virtgpu-backend: initialize all the error codes * ggml-virtgpu: add a missing comment generated by the code generator * ggml-virtgpu: add the '[virtgpu]' prefix to the device/buffer names * ggml-virtgpu: apir_device_buffer_from_ptr: improve the error message * ggml-virtgpu: shared: make it match the latest api_remoting.h of Virglrenderer APIR (still unmerged) * ggml-virtgpu: update the code generator to have dispatch_command_name in a host/guest shared file * ggml-virtgpu: REMOTE_CALL: fail if the backend returns an error * docs/backend/VirtGPU.md: indicate that the RAM+VRAM size is limed to 64 GB with libkrun * ggml-virtgpu: turn off clang-format header ordering for some of the files Compilation breaks when ordered alphabetically. * ggml-virtgpu: clang-format * ggml-virtgpu/backend/shared/api_remoting: better comments for the APIR return codes

* llama: Add option to merge gate and exp weights * Update convert_hf_to_gguf.py Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com> * Update convert_hf_to_gguf.py Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com> * update constants.py * add gate_up for the all MoE models * convert: simplify merge tensor condition * update constants.py * reduce number of models, add create_tensor_gate_up helper --------- Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

arthw and others added 8 commits February 26, 2026 10:27

support permuted, remove check s0/s10 (ggml-org#19889)

2943210

Co-authored-by: Neo Zhang Jianyu <jianyu.zhang@intel.com>

server : fix typo in server README.md (ggml-org#19900)

bd72300

fix typo

gguf : avoid too many file size calls (ggml-org#19919)

1ca3d1d

jinja : correct default size for string slices (ggml-org#19913)

9b62913

server: fix load-on-startup not respected in ini file (ggml-org#19897)

efba35a

Co-authored-by: Roman Marchenko <r.marchenko@ideco.ru>

jan-service-account merged commit 57d38f6 into dev Feb 27, 2026
3 checks passed

jan-service-account deleted the update-dev-from-master-2026-02-27-00-47 branch February 27, 2026 00:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync master with upstream release b8164#437

Sync master with upstream release b8164#437
jan-service-account merged 8 commits intodevfrom
update-dev-from-master-2026-02-27-00-47

jan-service-account commented Feb 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Conversation

jan-service-account commented Feb 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants