is there any quantized models for Qwen2.5-VL #719

aabbccddwasd · 2025-02-04T09:17:35Z

FP16 version is impossible to run on local GPU, I hope there will be GPTQ and AWQ versions, please!!!

ShuaiBai623 · 2025-02-05T05:41:24Z

The quantized models for Qwen2.5-VL are coming soon.

aabbccddwasd · 2025-02-05T08:05:57Z

The quantized models for Qwen2.5-VL are coming soon.

thanks！！ How soon， 1 week or 1 month？

devops724 · 2025-02-05T12:36:23Z

add vllm support Qwen2.5-VL too, please

aabbccddwasd · 2025-02-05T16:38:36Z

add vllm support Qwen2.5-VL too, please

well, I think vllm developers is responsible for this. they said v0.7.2 will support qwen2.5-vl.

aabbccddwasd mentioned this issue Feb 5, 2025

Plan for Quantized Version of Qwen2.5 VL Instruct Model? #703

Open

Provide feedback