-
Notifications
You must be signed in to change notification settings - Fork 2k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][doc] Update Qwen2/3-VL's model on supported_models.md
Multimodal
Label for issues & PRs regarding Multimodal related objects
#10797
opened Jan 19, 2026 by
yechank-nvidia
Loading…
[https://nvbugs/5814253][fix] unwaive test_autotuner_distributed_strategy tests
#10793
opened Jan 19, 2026 by
hyukn
Loading…
1 task done
[None][chore] switch to ConfigurableMoE as the default path
#10792
opened Jan 19, 2026 by
xxi-nv
Loading…
1 task done
[TRTLLM-10398][feat] Enable TRTLLM moe backend for Nemotron Super
#10791
opened Jan 19, 2026 by
nv-guomingz
•
Draft
1 task
[https://nvbugs/5814247][fix] unwaive AutoDeploy multi-gpu unit tests
#10789
opened Jan 19, 2026 by
hyukn
Loading…
1 task done
Draft: Support context sparse attention for mqa/gqa
#10788
opened Jan 19, 2026 by
heyuhhh
Loading…
1 task
[TRTLLM-10785][feat] Fix sharding dashboard errors
#10786
opened Jan 18, 2026 by
greg-kwasniewski1
Loading…
1 task done
[None][feat] Initial patch for trtllm-gen attention backend
#10784
opened Jan 18, 2026 by
yihwang-nv
Loading…
[TRTLLM-9581][Infra] Use /home/scratch.trt_llm_data_ci in computelab
#10783
opened Jan 18, 2026 by
ZhanruiSunCh
Loading…
1 task
[None][fix] AutoDeploy: Support per-expert input scales for FP8 MoE
#10778
opened Jan 18, 2026 by
galagam
Loading…
1 task done
[TRTLLM-10774][feat] fix sharding tests
#10775
opened Jan 17, 2026 by
greg-kwasniewski1
Loading…
1 task done
[DRAFT][AutoDeploy] Enhance memory consumption for moe fusion transform
#10772
opened Jan 17, 2026 by
taylor-yb-lee
•
Draft
1 task
[TRTLLM-10325][feat] Refactor speculative decoding workers
#10768
opened Jan 16, 2026 by
cascade812
Loading…
1 task done
[#9525][feat] add L2 norm pattern matcher and fusion transform
#10767
opened Jan 16, 2026 by
karthikvetrivel
Loading…
1 task done
[https://nvbugs/5814914][fix] Fix llama sm120 spec dec
#10765
opened Jan 16, 2026 by
mikeiovine
Loading…
1 task done
[https://nvbugs/5748664][fix] Increasing disagg acc test timeout
#10764
opened Jan 16, 2026 by
pcastonguay
Loading…
1 task done
[None][test] Update test case for release
#10763
opened Jan 16, 2026 by
crazydemo
Loading…
1 task done
[None][fix] Fix waived tests for Nemotron-h models
#10758
opened Jan 16, 2026 by
Wanli-Jiang
•
Draft
1 task done
[TRTLLM-10453][feat] Update mamba decode kernel to flashinfer
#10757
opened Jan 16, 2026 by
Wanli-Jiang
•
Draft
1 task
[https://nvbugs/5814203][fix] Fix port 8000 being used issue in stress test.
#10756
opened Jan 16, 2026 by
dominicshanshan
Loading…
1 task done
[None] [fix] Remove unnecessary ValueError
#10755
opened Jan 16, 2026 by
kaiyux
Loading…
1 task done
Previous Next
ProTip!
Updated in the last three days: updated:>2026-01-15.