Skip to content

docs : add MTP to GGUF Type slot#1487

Closed
mishig25 wants to merge 3 commits into
ggml-org:masterfrom
mishig25:gguf-naming-mtp-and-imatrix-note
Closed

docs : add MTP to GGUF Type slot#1487
mishig25 wants to merge 3 commits into
ggml-org:masterfrom
mishig25:gguf-naming-mtp-and-imatrix-note

Conversation

@mishig25

@mishig25 mishig25 commented May 12, 2026

Copy link
Copy Markdown
Contributor

Adds MTP as a third value for the Type slot (alongside LoRA and vocab) to cover Multi-Token Prediction / speculative-decoding draft modules shipped beside a base model. Updates the regex in both spots and adds an example + test case.

@ggerganov previously OK'd adding MTP.

Open question: the spec puts Type after Encoding, so the canonical form here is …-Q4_K_M-MTP.gguf. Some community releases place MTP before the encoding (…-MTP-Q4_K_M.gguf). Happy to flip if you'd rather match current filenames.

mishig25 added 2 commits May 12, 2026 22:46
Adds `MTP` as a third value for the `Type` slot, alongside `LoRA` and
`vocab`, to cover Multi-Token Prediction / speculative-decoding draft
modules shipped beside a base model. Updates the validation regex,
adds a parse example, and extends the Node.js test cases.

Also adds a short non-normative note that `IQ*` quants do not imply
imatrix calibration and that the `i1-` prefix used by some publishers
(e.g. mradermacher) is a community convention, not part of the spec.
Calibration provenance should live in metadata, not the filename.
@mishig25 mishig25 changed the title docs : add MTP to GGUF Type slot; note i1- as non-normative docs : add MTP to GGUF Type slot May 12, 2026
@mishig25 mishig25 closed this May 12, 2026
@mishig25 mishig25 deleted the gguf-naming-mtp-and-imatrix-note branch May 12, 2026 20:49
@mishig25

Copy link
Copy Markdown
Contributor Author

Closed by an unintentional branch rename on my fork — superseded by #1488 with identical changes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant