Skip to content

[cuda] Compact int4/int6 weight quant metadata (bf16 -> uint8 + per-row super-scale)#20571

Open
Gasoonjia wants to merge 4 commits into
mainfrom
cuda-int4-int6-metadata-opt
Open

[cuda] Compact int4/int6 weight quant metadata (bf16 -> uint8 + per-row super-scale)#20571
Gasoonjia wants to merge 4 commits into
mainfrom
cuda-int4-int6-metadata-opt

Commits

Commits on Jun 28, 2026

Commits on Jun 30, 2026

Commits on Jul 1, 2026