-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
chore: Use ellipsis as default value to detect whether residual argument is provided
#3626
opened Apr 16, 2025 by
yuxianq
Loading…
Fix [NVBUG 5219533 5220758] Fix script options for engines.
#3622
opened Apr 16, 2025 by
Tracin
Loading…
Fix: [NVBUG 5201530]Fix merge dummy request when DP + Overlap + Cuda graph.
#3621
opened Apr 16, 2025 by
Tracin
Loading…
[Draft]test: add INTEGRATION_TEST env var to speed up integration test
#3618
opened Apr 16, 2025 by
crazydemo
Loading…
Updating the run.py to make the draft target model run with the LLaMa 3 1B/8B
#3615
opened Apr 16, 2025 by
mayani-nv
Loading…
feat: Add support for smaller hidden_dim in AR fusion kernel
#3609
opened Apr 16, 2025 by
yilin-void
Loading…
feat: [AutoDeploy] generalizing cudagraph to multiple dynamic inputs
#3589
opened Apr 16, 2025 by
lucaslie
Loading…
feat: Integrate GPUDirect Storage (GDS) into Executor API
#3582
opened Apr 15, 2025 by
DomBrown
Loading…
fix : release torch-managed memory as soon as it's not needed
#3579
opened Apr 15, 2025 by
peaceh-nv
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.