Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[ROCm][Kernel] Using the correct warp_size value ready ONLY add when PR is ready to merge/full CI is needed
#12789 opened Feb 5, 2025 by gshtras Loading…
Improve TransformersModel UX
#12785 opened Feb 5, 2025 by hmellor Loading…
[NVIDIA] Support nvfp4 quantization ci/build
#12784 opened Feb 5, 2025 by kaixih Loading…
[VLM] Update compatibility with transformers 4.49 documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed
#12781 opened Feb 5, 2025 by DarkLight1337 Loading…
[Kernel] Make rotary_embedding ops more flexible with input shape ready ONLY add when PR is ready to merge/full CI is needed
#12777 opened Feb 5, 2025 by Isotr0py Loading…
2 tasks done
Use RMSNorm in TransformersModel ready ONLY add when PR is ready to merge/full CI is needed
#12776 opened Feb 5, 2025 by hmellor Loading…
[WIP][v1][Metrics] Add design doc documentation Improvements or additions to documentation v1
#12745 opened Feb 4, 2025 by markmc Draft
Merge similar examples in offline_inference into single basic example documentation Improvements or additions to documentation
#12737 opened Feb 4, 2025 by hmellor Loading…
Update to torch==2.6.0 ci/build ready ONLY add when PR is ready to merge/full CI is needed
#12721 opened Feb 4, 2025 by mgoin Loading…
Quantization and MoE configs for GH200 machines
#12717 opened Feb 4, 2025 by arvindsun Loading…
ProTip! Adding no:label will show everything without a label.