Skip to content

Pull requests: llm-d/llm-d-kv-cache

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Support multiple gpu blocks in a single object store block size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#558 opened Apr 29, 2026 by effi-ofer Contributor Loading…
tests: add scheduler-integration leaf module guarding kv-cache import surface size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
#557 opened Apr 29, 2026 by yankay Collaborator Loading…
update: remove VLLM_VERSION from makefile size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
#556 opened Apr 29, 2026 by zdtsw Contributor Loading…
Fix: UDS Tokenizer render error lgtm Looks good to me, indicates that a PR is ready to be merged. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
#554 opened Apr 29, 2026 by yyzxw Loading…
Add redis lookup and improve nixl lookup size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#552 opened Apr 28, 2026 by effi-ofer Contributor Loading…
deps(go): bump go.opentelemetry.io/otel from 1.39.0 to 1.41.0 dependencies Pull requests that update a dependency file size/S Denotes a PR that changes 10-29 lines, ignoring generated files.
#541 opened Apr 24, 2026 by dependabot Bot Loading…
feat: Add RenderBatchCompletion RPC for multi-prompt tokenization size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
#538 opened Apr 24, 2026 by albertoperdomo2 Contributor Loading…
update: add make target to python lint + fix lint size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#536 opened Apr 23, 2026 by zdtsw Contributor Loading…
fix(e2e): fix container image build platform for macOS/non-Linux hosts lgtm Looks good to me, indicates that a PR is ready to be merged. size/S Denotes a PR that changes 10-29 lines, ignoring generated files.
#535 opened Apr 23, 2026 by gyliu513 Contributor Loading…
Add Hybrid Multi-head Attention (HMA) support for KV-Cache scoring size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#533 opened Apr 19, 2026 by kapiljain1989 Loading…
Removed Old Helm Setup size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#530 opened Apr 19, 2026 by kapiljain1989 Loading…
deps(actions): bump softprops/action-gh-release from 2 to 3 dependencies Pull requests that update a dependency file size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
#516 opened Apr 14, 2026 by dependabot Bot Loading…
Handling Attention Group id in KV events size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#510 opened Apr 10, 2026 by kapiljain1989 Loading…
deps(go): bump go.opentelemetry.io/otel/sdk from 1.39.0 to 1.43.0 dependencies Pull requests that update a dependency file size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
#503 opened Apr 8, 2026 by dependabot Bot Loading…
deps(actions): bump docker/setup-buildx-action from 3 to 4 dependencies Pull requests that update a dependency file lgtm Looks good to me, indicates that a PR is ready to be merged. lifecycle/stale size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
#501 opened Apr 7, 2026 by dependabot Bot Loading…
deps(actions): bump docker/build-push-action from 6 to 7 dependencies Pull requests that update a dependency file lgtm Looks good to me, indicates that a PR is ready to be merged. lifecycle/stale size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
#500 opened Apr 7, 2026 by dependabot Bot Loading…
feat: Add HMA support to FS connector size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
#476 opened Mar 29, 2026 by kfirtoledo Collaborator Draft
4 tasks done
test: Add test and usage example for mm requests lifecycle/stale size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#453 opened Mar 23, 2026 by sagearc Collaborator Loading…
fix lint errors lifecycle/stale
#446 opened Mar 23, 2026 by roytman Contributor Loading…
deps(go): bump google.golang.org/grpc from 1.77.0 to 1.79.3 dependencies Pull requests that update a dependency file lifecycle/rotten
#438 opened Mar 19, 2026 by dependabot Bot Loading…
feat:add support to invalidate KV cache via AllBlocksCleared event size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
#437 opened Mar 18, 2026 by yash9263 Loading…
deps(go): bump the go-dependencies group across 1 directory with 16 updates dependencies Pull requests that update a dependency file size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#430 opened Mar 17, 2026 by dependabot Bot Loading…
feat: Add Hybrid Model Architecture (HMA) Support in Prefix-Cache Aware Scheduling lifecycle/stale size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#427 opened Mar 16, 2026 by kapiljain1989 Loading…
fix: Close data race in InMemoryIndex Add/Evict with RWMutex size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#422 opened Mar 16, 2026 by gyliu513 Contributor Loading…
ProTip! Updated in the last three days: updated:>2026-04-26.