NVIDIA / TensorRT-LLM Public

Notifications You must be signed in to change notification settings
Fork 2.2k
Star 13.1k

Code
Issues 544
Pull requests 584
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: NVIDIA/TensorRT-LLM

Labels 59 Milestones 1

New pull request New

584 Open 7,941 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[None][feat] KV cache-aware ADP router for prefix-affinity request routing

#12315 opened Mar 18, 2026 by lancelly

Loading…

[NVBug 5969206][fix] Force-terminate in-flight KV transfers on abort

#12314 opened Mar 18, 2026 by nv-yna • Draft

4 tasks

[NVBug 5969206][fix] Break circular dependency in disagg KV transfer timeout cleanup

#12313 opened Mar 18, 2026 by nv-yna • Draft

3 tasks

[None][docs] update tech blog listings

#12312 opened Mar 18, 2026 by bobboli

Loading…

1 task

[#12230][fix] Add bounds checking in autotuner _find_nearest_profile for SM121 Community want to contribute

PRs initiated from Community

#12310 opened Mar 18, 2026 by mihai-chiorean

Loading…

3 tasks done

[#11932][fix] Enable FP4 MoE dispatch for SM120/SM121 (DGX Spark) Community want to contribute

PRs initiated from Community

#12309 opened Mar 18, 2026 by mihai-chiorean

Loading…

4 tasks done

[None][fix] Fix KVCacheManagerV2 fallback, block index, and multimodal block reuse

#12306 opened Mar 18, 2026 by yizhang-nv

Loading…

1 task done

[TRTLLM-10363][feat] Add instructions to run Cosmos Predict2.5 on DGX Spark

#12305 opened Mar 18, 2026 by pamelap-nvidia

Loading…

1 task done

[TRTLLM-11544][feat] Add Qwen 3.5 supporting.

#12302 opened Mar 18, 2026 by nv-guomingz • Draft

1 task done

[None][perf] Optimize KV cache for unified memory systems (DGX Spark) Community want to contribute

PRs initiated from Community

#12301 opened Mar 18, 2026 by mihai-chiorean

Loading…

6 tasks done

[None][test] Fix mpi-type issue and add wideep acc test to dev's l0 local flow

#12300 opened Mar 18, 2026 by fredricz-20070104

Loading…

batch_manager: fix iterator UB in WindowBlockManager::getFreeBlock offload path

#12297 opened Mar 17, 2026 by thorjohnsen

Loading…

3 tasks

[None][infra] Update CI allowedlist

#12296 opened Mar 17, 2026 by yuanjingx87

Loading…

1 task done

[None][chore] Bump version to 1.3.0rc9

#12295 opened Mar 17, 2026 by yuanjingx87

Loading…

1 task done

[#3237][fix] Support negative numbers in MajorityVote digit validation Community want to contribute

PRs initiated from Community

#12294 opened Mar 17, 2026 by nikJ13

Loading…

[None][fix] Grouping deltas within one streaming interval to reduce overhead

#12292 opened Mar 17, 2026 by dongfengy

Loading…

1 task done

[#10607][feat] added AutoDeploy serving perf test with Super test

#12287 opened Mar 17, 2026 by MrGeva

Loading…

1 task

[#11526][chore] AutoDeploy accuracy tests: Use Llama3.1-8B-Instruct official checkpoints

#12285 opened Mar 17, 2026 by galagam • Draft

1 task done

[None][fix] Fix VLM guided decoding startup crash due to missing vocab_size_padded property Community want to contribute

PRs initiated from Community

#12284 opened Mar 17, 2026 by stefanpantic

Loading…

4 of 9 tasks

[https://nvbugs/5781383][chore] Unwaive test

#12282 opened Mar 17, 2026 by shuyixiong

Loading…

1 task done

[https://nvbugs/5893116][fix] fix disagg llama oom

#12281 opened Mar 17, 2026 by chuangz0

Loading…

1 task done

[TRTLLM-11551][feat] Support WideEP MoE backend for nemotron-h models

#12280 opened Mar 17, 2026 by Wanli-Jiang

Loading…

1 task done

[https://nvbugs/5937478][test] Add RCCA test for DeepSeek-V3.2 multi-turn tool_call encoding

#12279 opened Mar 17, 2026 by crazydemo

Loading…

1 task done

[None][test] Add DSA host cache offload tests to CI and QA test lists

#12278 opened Mar 17, 2026 by longlee0622

Loading…

2 tasks done

[https://nvbugs/5389100][test] Remove TensorRT integration test list and add trtllm-serve for test_perf.py

#12277 opened Mar 17, 2026 by yufeiwu-nv

Loading…

1 task done

Previous 1 2 3 4 5 … 23 24 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!