gguf-py: byteswapping improvements #12851

AlekseiNikiforovIBM · 2025-04-09T12:10:27Z

Implement byteswapping for Q4_0

This is needed to byteswap Mistral model.

Also restore original shapes after byteswapping tensors.
It is not needed at the moment, but do it in case they'd be used in future.

Rework byteswapping code in gguf-py: move out details from byteswapping tensor blocks code.

This is needed to byteswap Mistral model. Also restore original shapes after byteswapping tensors. It is not needed at the moment, but do it in case they'd be used in future.

Move out details from byteswapping tensor blocks code

compilade · 2025-04-09T14:13:40Z

I think it could be useful to add a byteswap method for all types in gguf-py/gguf/quants.py.

That would also be useful for GGUFWriter (and maybe also GGUFReader).

Then gguf_convert_endian.py could eventually simply be a read-write round trip with the desired endian set for GGUFWriter (while GGUFReader auto-detects the endianness of the source file)

gguf-py: implement byteswapping for Q4_0

1c63021

This is needed to byteswap Mistral model. Also restore original shapes after byteswapping tensors. It is not needed at the moment, but do it in case they'd be used in future.

github-actions bot added the python python script changes label Apr 9, 2025

Rework byteswapping code in gguf-py

ce893e1

Move out details from byteswapping tensor blocks code

AlekseiNikiforovIBM force-pushed the byteswapping_2 branch from 4ae4108 to ce893e1 Compare April 9, 2025 12:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gguf-py: byteswapping improvements #12851

gguf-py: byteswapping improvements #12851

AlekseiNikiforovIBM commented Apr 9, 2025

compilade commented Apr 9, 2025

gguf-py: byteswapping improvements #12851

Are you sure you want to change the base?

gguf-py: byteswapping improvements #12851

Conversation

AlekseiNikiforovIBM commented Apr 9, 2025

compilade commented Apr 9, 2025