..
_torch
[None][feat] Use Separate QKV Input Layout for Context MLA ( #6538 )
2025-08-19 22:04:48 +08:00
api_stability
[None][opt] Add batch wait timeout in fetching requests ( #6923 )
2025-08-19 03:50:08 -04:00
bindings
[TRTLLM-6881][feat] Include attention dp rank info with KV cache events ( #6563 )
2025-08-07 14:17:07 +02:00
disaggregated
feat: Dynamically remove servers in PD ( #5270 )
2025-06-25 09:50:04 +08:00
llmapi
[ https://nvbugs/5451296 ][bug] Cherry-pick #7017 from release/1.0 branch ( #7043 )
2025-08-19 11:25:05 -04:00
others
[TRTLLM-5966][feat] Helix: extend mapping to support different CP types ( #6816 )
2025-08-14 09:00:02 -07:00
scaffolding
[ https://nvbugs/5387375 ] fix(scaffolding): fix scaffolding aime test in test_e2e ( #6140 )
2025-07-18 10:34:37 +08:00
tools
enh: Add script to map tests <-> jenkins stages & vice-versa ( #5177 )
2025-07-19 00:50:40 +08:00
trt
[TRTLLM-5863][feat] Support MoE INT8 Weight-Only-Quantization in PyTorch Workflow ( #6629 )
2025-08-15 17:15:49 -04:00
utils
[None][fix] Migrate to new cuda binding package name ( #6700 )
2025-08-07 16:29:55 -04:00
conftest.py
[TRTLLM-5508][feat] check input tokens + improve error handling ( #5170 )
2025-08-05 18:27:43 +01:00
dump_checkpoint_stats.py
Update TensorRT-LLM ( #2936 )
2025-03-18 21:25:19 +08:00
gc_utils.py
[nvbug 5273941] fix: broken cyclic reference detect ( #5417 )
2025-07-01 20:12:55 +08:00
profile_utils.py
Update TensorRT-LLM ( #2936 )
2025-03-18 21:25:19 +08:00
pytest.ini
[ci] parallelize torch unittests ( #5714 )
2025-07-09 11:05:57 +03:00
test_model_runner_cpp.py
Update TensorRT-LLM ( #2936 )
2025-03-18 21:25:19 +08:00
test_pip_install.py
[TRTLLM-7141][infra] Use repo mirrors to avoid intermittent network failures ( #6836 )
2025-08-15 11:16:07 +08:00