-
Notifications
You must be signed in to change notification settings - Fork 119
Pull requests: llm-d/llm-d-kv-cache
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Support multiple gpu blocks in a single object store block
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#558
opened Apr 29, 2026 by
effi-ofer
Contributor
Loading…
tests: add scheduler-integration leaf module guarding kv-cache import surface
size/XL
Denotes a PR that changes 500-999 lines, ignoring generated files.
#557
opened Apr 29, 2026 by
yankay
Collaborator
Loading…
update: remove VLLM_VERSION from makefile
size/XS
Denotes a PR that changes 0-9 lines, ignoring generated files.
#556
opened Apr 29, 2026 by
zdtsw
Contributor
Loading…
Add redis lookup and improve nixl lookup
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#552
opened Apr 28, 2026 by
effi-ofer
Contributor
Loading…
deps(go): bump go.opentelemetry.io/otel from 1.39.0 to 1.41.0
dependencies
Pull requests that update a dependency file
size/S
Denotes a PR that changes 10-29 lines, ignoring generated files.
#541
opened Apr 24, 2026 by
dependabot
Bot
Loading…
feat: Add RenderBatchCompletion RPC for multi-prompt tokenization
size/XL
Denotes a PR that changes 500-999 lines, ignoring generated files.
#538
opened Apr 24, 2026 by
albertoperdomo2
Contributor
Loading…
update: add make target to python lint + fix lint
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#536
opened Apr 23, 2026 by
zdtsw
Contributor
Loading…
Add Hybrid Multi-head Attention (HMA) support for KV-Cache scoring
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#533
opened Apr 19, 2026 by
kapiljain1989
Loading…
Removed Old Helm Setup
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#530
opened Apr 19, 2026 by
kapiljain1989
Loading…
deps(actions): bump softprops/action-gh-release from 2 to 3
dependencies
Pull requests that update a dependency file
size/XS
Denotes a PR that changes 0-9 lines, ignoring generated files.
#516
opened Apr 14, 2026 by
dependabot
Bot
Loading…
Handling Attention Group id in KV events
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#510
opened Apr 10, 2026 by
kapiljain1989
Loading…
deps(go): bump go.opentelemetry.io/otel/sdk from 1.39.0 to 1.43.0
dependencies
Pull requests that update a dependency file
size/M
Denotes a PR that changes 30-99 lines, ignoring generated files.
#503
opened Apr 8, 2026 by
dependabot
Bot
Loading…
deps(actions): bump docker/setup-buildx-action from 3 to 4
dependencies
Pull requests that update a dependency file
lgtm
Looks good to me, indicates that a PR is ready to be merged.
lifecycle/stale
size/XS
Denotes a PR that changes 0-9 lines, ignoring generated files.
#501
opened Apr 7, 2026 by
dependabot
Bot
Loading…
deps(actions): bump docker/build-push-action from 6 to 7
dependencies
Pull requests that update a dependency file
lgtm
Looks good to me, indicates that a PR is ready to be merged.
lifecycle/stale
size/XS
Denotes a PR that changes 0-9 lines, ignoring generated files.
#500
opened Apr 7, 2026 by
dependabot
Bot
Loading…
feat: Add HMA support to FS connector
size/XL
Denotes a PR that changes 500-999 lines, ignoring generated files.
#476
opened Mar 29, 2026 by
kfirtoledo
Collaborator
•
Draft
4 tasks done
test: Add test and usage example for mm requests
lifecycle/stale
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#453
opened Mar 23, 2026 by
sagearc
Collaborator
Loading…
deps(go): bump google.golang.org/grpc from 1.77.0 to 1.79.3
dependencies
Pull requests that update a dependency file
lifecycle/rotten
#438
opened Mar 19, 2026 by
dependabot
Bot
Loading…
feat:add support to invalidate KV cache via AllBlocksCleared event
size/XL
Denotes a PR that changes 500-999 lines, ignoring generated files.
#437
opened Mar 18, 2026 by
yash9263
Loading…
deps(go): bump the go-dependencies group across 1 directory with 16 updates
dependencies
Pull requests that update a dependency file
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#430
opened Mar 17, 2026 by
dependabot
Bot
Loading…
feat: Add Hybrid Model Architecture (HMA) Support in Prefix-Cache Aware Scheduling
lifecycle/stale
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#427
opened Mar 16, 2026 by
kapiljain1989
Loading…
fix: Close data race in InMemoryIndex Add/Evict with RWMutex
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#422
opened Mar 16, 2026 by
gyliu513
Contributor
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-04-26.