docs : add MTP to GGUF Type slot by mishig25 · Pull Request #1488 · ggml-org/ggml

mishig25 · 2026-05-12T20:50:10Z

Adds MTP as a third value for the Type slot (alongside LoRA and vocab) to cover Multi-Token Prediction / speculative-decoding draft modules shipped beside a base model. Updates the regex in both spots and adds an example + test case

Adds `MTP` as a third value for the `Type` slot (alongside `LoRA` and `vocab`) to cover Multi-Token Prediction / speculative-decoding draft modules shipped beside a base model. Updates the validation regex in both the prose and JS copies, adds a filename example, and extends the Node.js test cases.

ngxson · 2026-05-13T15:33:25Z

 At a minimum all model files should have at least BaseName, SizeLabel, Version, in order to be easily validated as a file that is keeping with the GGUF Naming Convention. An example of this issue is that it is easy for Encoding to be mistaken as a FineTune if Version is omitted.

-To validate you can use this regular expression `^(?<BaseName>[A-Za-z0-9\s]*(?:(?:-(?:(?:[A-Za-z\s][A-Za-z0-9\s]*)|(?:[0-9\s]*)))*))-(?:(?<SizeLabel>(?:\d+x)?(?:\d+\.)?\d+[A-Za-z](?:-[A-Za-z]+(\d+\.)?\d+[A-Za-z]+)?)(?:-(?<FineTune>[A-Za-z0-9\s-]+))?)?-(?:(?<Version>v\d+(?:\.\d+)*))(?:-(?<Encoding>(?!LoRA|vocab)[\w_]+))?(?:-(?<Type>LoRA|vocab))?(?:-(?<Shard>\d{5}-of-\d{5}))?\.gguf$` which will check that you got the minimum BaseName, SizeLabel and Version present in the correct order.
+To validate you can use this regular expression `^(?<BaseName>[A-Za-z0-9\s]*(?:(?:-(?:(?:[A-Za-z\s][A-Za-z0-9\s]*)|(?:[0-9\s]*)))*))-(?:(?<SizeLabel>(?:\d+x)?(?:\d+\.)?\d+[A-Za-z](?:-[A-Za-z]+(\d+\.)?\d+[A-Za-z]+)?)(?:-(?<FineTune>[A-Za-z0-9\s-]+))?)?-(?:(?<Version>v\d+(?:\.\d+)*))(?:-(?<Encoding>(?!LoRA|vocab|MTP)[\w_]+))?(?:-(?<Type>LoRA|vocab|MTP))?(?:-(?<Shard>\d{5}-of-\d{5}))?\.gguf$` which will check that you got the minimum BaseName, SizeLabel and Version present in the correct order.


I think is might be incorrect, we went with the mtp- prefix in this PR: https://github.com/am17an/llama.cpp/pull/9/changes#diff-03b361169a690c5ac8e77460aeba18d833d2b78babab92b7b5bb721fc34947c9R612-R620

also mmproj- as prefix is pretty much a standard now, we should probably also add it to the regex

feel free to open a follow-up PR @mishig25

Opened #1496 : moves MTP out of Type and introduces a Sidecar prefix slot covering both mtp- and mmproj-. wdyt?

where as the unsloth ones don't encode MTP in the gguf filename

Unsloth-style (MTP in repo name, clean filename): the entire repo is dedicated to MTP variants, so MTP is implied by the repo name. Each file inside just needs to disambiguate by quant (Qwen3.6-27B-Q4_K_M.gguf, Qwen3.6-27B-Q5_K_M.gguf, etc.). See unsloth/Qwen3.6-27B-MTP-GGUF repo

yes, indeed if the model already come with MTP support, it's always better to have both main model + LLM in the same GGUF, it does save a bit of VRAM that way.

the case where MTP and main model are separate GGUFs is mostly for eagle3-style models

do you have an example repo for eagle3-style?

do you have an example repo for eagle3-style?

so far, every Eagle3 repo I found ships safetensors only (nvidia/gpt-oss-120b-Eagle3-v3, openbmb/MiniCPM4.1-8B-Eagle3, thoughtworks/Qwen3-8B-Eagle3)

* docs : add Sidecar prefix slot (mmproj, mtp); drop MTP from Type Introduces an optional Sidecar prefix slot at the front of the GGUF filename for auxiliary modules loaded alongside a base model: - mmproj: multimodal projector - mtp: Multi-Token Prediction draft module Removes MTP from the Type slot (added in #1488) so there is exactly one canonical position. Updates the regex (prose + JS), parse helper, filename examples, and Node.js test cases accordingly. * docs : clarify sidecar Parameter Count refers to main model * docs : address julien-c review (format-string consistency + mtp caveat)

mishig25 mentioned this pull request May 12, 2026

docs : add MTP to GGUF Type slot #1487

Closed

mishig25 force-pushed the gguf-naming-mtp branch from 9968a03 to 3ce3599 Compare May 12, 2026 20:50

ggerganov approved these changes May 13, 2026

View reviewed changes

ggerganov merged commit 5725fee into ggml-org:master May 13, 2026

This was referenced May 13, 2026

gguf: parser for percentage-mixed GGUF filenames huggingface/huggingface.js#2170

Draft

gguf: parser for GGUF filename variants (LoRA / vocab / MTP / imatrix) huggingface/huggingface.js#2171

Open

ngxson reviewed May 13, 2026

View reviewed changes

mishig25 mentioned this pull request May 18, 2026

docs : add Sidecar prefix slot (mmproj, mtp); drop MTP from Type #1496

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs : add MTP to GGUF Type slot#1488

docs : add MTP to GGUF Type slot#1488
ggerganov merged 1 commit into
ggml-org:masterfrom
mishig25:gguf-naming-mtp

mishig25 commented May 12, 2026 •

edited

Loading

Uh oh!

ngxson May 13, 2026

Uh oh!

ngxson May 13, 2026

Uh oh!

julien-c May 13, 2026

Uh oh!

ngxson May 13, 2026

Uh oh!

mishig25 May 18, 2026 •

edited

Loading

Uh oh!

julien-c May 19, 2026

Uh oh!

mishig25 May 19, 2026 •

edited

Loading

Uh oh!

ngxson May 19, 2026

Uh oh!

julien-c May 19, 2026

Uh oh!

mishig25 May 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mishig25 commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mishig25 May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mishig25 May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mishig25 commented May 12, 2026 •

edited

Loading

mishig25 May 18, 2026 •

edited

Loading

mishig25 May 19, 2026 •

edited

Loading