Skip to content

Pull requests: vllm-project/production-stack

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Bugfix][Router] use model_type instead of model_label
#705 opened Sep 19, 2025 by max-wittig Loading…
2 of 3 tasks
[Build] Update LMCache dependency to version 0.3.6
#701 opened Sep 17, 2025 by ikaadil Loading…
3 tasks
[Misc] bump up otel col version and use a simplified image
#698 opened Sep 17, 2025 by JaredTan95 Loading…
3 tasks done
[Bugfix] kv aware routing for lmcache 0.3.5
#697 opened Sep 15, 2025 by zerofishnoodles Loading…
3 tasks done
feat: allow for configuration of number of uvicorn workers
#689 opened Sep 9, 2025 by TheCodeWrangler Loading…
8 tasks done
[Bugfix] cache server yaml err
#677 opened Sep 4, 2025 by yyzxw Loading…
3 tasks done
[Feat][Router] Add TTFT Routing
#670 opened Sep 1, 2025 by chickeyton Draft
3 tasks
[Feat][PD] lastest PD support from LMCache with NIXL
#669 opened Aug 28, 2025 by kobe0938 Loading…
3 tasks
[Build] use uv with cache mount for faster docker builds
#657 opened Aug 23, 2025 by Hexoplon Loading…
3 tasks done
[Feat][Router] add ability to specify params to drop
#650 opened Aug 20, 2025 by max-wittig Loading…
3 tasks done
Add max_model_len field support to router
#638 opened Aug 11, 2025 by llm-net Loading…
[Feat] Router-side queuing support
#626 opened Aug 2, 2025 by allytotheson Loading…
[Feat] Implement workflow-aware routing for multi-agent AI workflows
#625 opened Aug 2, 2025 by hongsw Loading…
6 tasks done
ProTip! Add no:assignee to see everything that’s not assigned.