Quantization information #308

fabrizio-ottati · 2025-02-13T13:48:44Z

Hi! I was not able to find instructions on how to find information about the quantization when the model gets scaled, in terms of weights and activations scalers and zero points. I am using a W4A8 scheme. Sorry for the naive question :)

gushiqiao · 2025-02-17T11:26:12Z

llmc/llmc/compression/quantization/module_utils.py

Line 475 in 50b0da7

class FakeQuantLinear(nn.Module):

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantization information #308

Quantization information #308

fabrizio-ottati commented Feb 13, 2025

gushiqiao commented Feb 17, 2025

Quantization information #308

Quantization information #308

Comments

fabrizio-ottati commented Feb 13, 2025

gushiqiao commented Feb 17, 2025