-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][feat] KV cache-aware ADP router for prefix-affinity request routing
#12315
opened Mar 18, 2026 by
lancelly
Loading…
[#12230][fix] Add bounds checking in autotuner _find_nearest_profile for SM121
Community want to contribute
PRs initiated from Community
#12310
opened Mar 18, 2026 by
mihai-chiorean
Loading…
3 tasks done
[#11932][fix] Enable FP4 MoE dispatch for SM120/SM121 (DGX Spark)
Community want to contribute
PRs initiated from Community
#12309
opened Mar 18, 2026 by
mihai-chiorean
Loading…
4 tasks done
[None][fix] Fix KVCacheManagerV2 fallback, block index, and multimodal block reuse
#12306
opened Mar 18, 2026 by
yizhang-nv
Loading…
1 task done
[TRTLLM-10363][feat] Add instructions to run Cosmos Predict2.5 on DGX Spark
#12305
opened Mar 18, 2026 by
pamelap-nvidia
Loading…
1 task done
[TRTLLM-11544][feat] Add Qwen 3.5 supporting.
#12302
opened Mar 18, 2026 by
nv-guomingz
•
Draft
1 task done
[None][perf] Optimize KV cache for unified memory systems (DGX Spark)
Community want to contribute
PRs initiated from Community
#12301
opened Mar 18, 2026 by
mihai-chiorean
Loading…
6 tasks done
[None][test] Fix mpi-type issue and add wideep acc test to dev's l0 local flow
#12300
opened Mar 18, 2026 by
fredricz-20070104
Loading…
batch_manager: fix iterator UB in WindowBlockManager::getFreeBlock offload path
#12297
opened Mar 17, 2026 by
thorjohnsen
Loading…
3 tasks
[None][chore] Bump version to 1.3.0rc9
#12295
opened Mar 17, 2026 by
yuanjingx87
Loading…
1 task done
[#3237][fix] Support negative numbers in MajorityVote digit validation
Community want to contribute
PRs initiated from Community
#12294
opened Mar 17, 2026 by
nikJ13
Loading…
[None][fix] Grouping deltas within one streaming interval to reduce overhead
#12292
opened Mar 17, 2026 by
dongfengy
Loading…
1 task done
[#10607][feat] added AutoDeploy serving perf test with Super test
#12287
opened Mar 17, 2026 by
MrGeva
Loading…
1 task
[None][fix] Fix VLM guided decoding startup crash due to missing vocab_size_padded property
Community want to contribute
PRs initiated from Community
#12284
opened Mar 17, 2026 by
stefanpantic
Loading…
4 of 9 tasks
[https://nvbugs/5781383][chore] Unwaive test
#12282
opened Mar 17, 2026 by
shuyixiong
Loading…
1 task done
[https://nvbugs/5893116][fix] fix disagg llama oom
#12281
opened Mar 17, 2026 by
chuangz0
Loading…
1 task done
[TRTLLM-11551][feat] Support WideEP MoE backend for nemotron-h models
#12280
opened Mar 17, 2026 by
Wanli-Jiang
Loading…
1 task done
[https://nvbugs/5937478][test] Add RCCA test for DeepSeek-V3.2 multi-turn tool_call encoding
#12279
opened Mar 17, 2026 by
crazydemo
Loading…
1 task done
[None][test] Add DSA host cache offload tests to CI and QA test lists
#12278
opened Mar 17, 2026 by
longlee0622
Loading…
2 tasks done
[https://nvbugs/5389100][test] Remove TensorRT integration test list and add trtllm-serve for test_perf.py
#12277
opened Mar 17, 2026 by
yufeiwu-nv
Loading…
1 task done
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.