-
-
Notifications
You must be signed in to change notification settings - Fork 299
Issues: turboderp-org/exllamav2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[BUG] When trying inference with Qwen2.5-VL-72B with Qwen2.5-VL-7B as a draft model, I get "IndexError: index out of range in self" (both models have identical vocab.json)
bug
Something isn't working
#733
opened Feb 6, 2025 by
Lissanro
3 tasks done
[BUG] Exception in ASGI application when trying inference with an image wit h Qwen2.5-VL-72B
bug
Something isn't working
#732
opened Feb 5, 2025 by
Lissanro
3 tasks done
[BUG] Mistral-Small-24B-Instruct-2501 - Tensor Parallel outputs garbled text.
bug
Something isn't working
#728
opened Jan 31, 2025 by
mindkrypted
3 tasks done
[BUG] 2080ti can't quant a 12B model
bug
Something isn't working
#725
opened Jan 28, 2025 by
frenzybiscuit
3 tasks done
[REQUEST] Support new SOTA vision model: Qwen 2.5 VL (3B, 7B, 72B)
#724
opened Jan 27, 2025 by
ThomasBaruzier
3 tasks done
[REQUEST] Support x-grammar structured output framework integration
#723
opened Jan 27, 2025 by
debasish-mihup
3 tasks done
[REQUEST] GraniteMoeForCausalLM architecture support
#722
opened Jan 27, 2025 by
cal066
3 tasks done
[REQUEST] Add support for Blackwell B200 with Cuda 12.8
#721
opened Jan 25, 2025 by
ofirkris
3 tasks done
[BUG] LORA fail load/inference for tensor parallel.
bug
Something isn't working
#719
opened Jan 22, 2025 by
Ph0rk0z
3 tasks done
[REQUEST] graceful fallback when gpu_split is misspecified
#711
opened Jan 4, 2025 by
jwlockhart
3 tasks done
[REQUEST] Sage Attention? Anyone tried it with exllama?
#702
opened Dec 21, 2024 by
Ph0rk0z
3 tasks done
[BUG] Qwen2.5-72B-2.xxbpw/Llama-70B-2.4bpw/even-4.xxbpw(maybe related to KV caching code) garbage output on some specific prompts.
bug
Something isn't working
#697
opened Dec 14, 2024 by
Originalimoc
3 tasks done
[BUG] lmformatenforcer integration seems to be broken on new versions
bug
Something isn't working
#696
opened Dec 11, 2024 by
hvico
3 tasks done
[BUG] ExLlamaV2DynamicGenerator class is not multiple threads supported
bug
Something isn't working
#690
opened Nov 29, 2024 by
UTSAV-44
3 tasks done
[BUG] Something isn't working
generator.iterate()
returns corrupted result objects in some cases
bug
#689
opened Nov 29, 2024 by
p-e-w
3 tasks done
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.