-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TRTLLM-12153][ci] Drop tensorrt_llm/llmapi/ from multi-GPU trigger list
#13993
opened May 11, 2026 by
QiJune
Collaborator
Loading…
1 task done
[None][chore] Update flashinfer-python from 0.6.10 to 0.6.11
#13992
opened May 11, 2026 by
yihwang-nv
Collaborator
Loading…
4 tasks
[None][infra] Waive 4 failed cases for main in post-merge
#13990
opened May 11, 2026 by
xinhe-nv
Collaborator
Loading…
[None][infra] Waive 20 failed cases for main in post-merge
#13989
opened May 11, 2026 by
xinhe-nv
Collaborator
Loading…
[None][test] Add DeepSeek V4 CI coverage
deepseek-v4
#13988
opened May 11, 2026 by
lfr-0531
Collaborator
Loading…
1 task done
[None][infra] Waive 7 failed cases for main in post-merge
#13987
opened May 11, 2026 by
xinhe-nv
Collaborator
Loading…
[None][infra] Waive 13 failed cases for main in post-merge
#13986
opened May 11, 2026 by
xinhe-nv
Collaborator
Loading…
[TRTLLM-12580][perf] ltx2: fused RMSNorm+RoPE across all attention paths + PE pre-shard
#13985
opened May 11, 2026 by
luyiyun1021
Collaborator
Loading…
1 task done
[None][infra] Waive 4 failed cases for main in post-merge
#13984
opened May 11, 2026 by
xinhe-nv
Collaborator
Loading…
[https://nvbugs/6162940][fix] Added a
SentencePieceTokenizer wrapper in examples/utils.py that drives `sen
#13983
opened May 11, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][infra] Waive 12 failed cases for main in post-merge
#13982
opened May 11, 2026 by
xinhe-nv
Collaborator
Loading…
[None][infra] Waive 19 failed cases for main in post-merge
#13981
opened May 11, 2026 by
xinhe-nv
Collaborator
Loading…
[None][infra] Waive 13 failed cases for main in post-merge
#13980
opened May 11, 2026 by
xinhe-nv
Collaborator
Loading…
[None][fix] Fix replay iter flag names in layer-wise benchmarks docs
#13979
opened May 11, 2026 by
kaiyux
Member
Loading…
3 tasks done
[None][feat] LTX2 Ulysses async A2A pipeline via NCCL window + LSA barrier
#13978
opened May 11, 2026 by
luyiyun1021
Collaborator
Loading…
1 task done
[None][perf] Enable in-flight batching for NanoV2VL multimodal encoder
#13977
opened May 11, 2026 by
yechank-nvidia
Collaborator
•
Draft
[https://nvbugs/6162853][chore] unwaive test
#13976
opened May 11, 2026 by
galagam
Collaborator
Loading…
1 task done
[None][perf] Add CUDA q_b norm for DeepSeek V4
deepseek-v4
#13975
opened May 11, 2026 by
mingyangHao
Collaborator
Loading…
1 task
[TRTLLM-12596][feat] Support simple logprob format
#13972
opened May 11, 2026 by
tongyuantongyu
Member
Loading…
1 task done
[None][perf] Follow-up patch for "Improve TRTLLM MoE autotune in DEP (#13667)"
#13971
opened May 11, 2026 by
rosenrodt
Collaborator
Loading…
1 task done
[None][fix] Fix misleading skills that use the -ccache option
#13970
opened May 11, 2026 by
yuantailing
Member
Loading…
1 task done
[None][fix] Fix bugs related with nemotron-nas model
#13968
opened May 11, 2026 by
Wanli-Jiang
Collaborator
Loading…
1 task done
[None][feat] Stack PRs for sweep perfing and accuracy checking
#13967
opened May 11, 2026 by
Wanli-Jiang
Collaborator
•
Draft
1 task done
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.