| .. |
|
_torch
|
[https://nvbugs/5458874][fix] Fix Nemotron-H flaky CUDA graph / overlap scheduler test (#6996)
|
2025-08-19 15:45:06 +03:00 |
|
api_stability
|
[None][opt] Add batch wait timeout in fetching requests (#6923)
|
2025-08-19 03:50:08 -04:00 |
|
bindings
|
[TRTLLM-6881][feat] Include attention dp rank info with KV cache events (#6563)
|
2025-08-07 14:17:07 +02:00 |
|
disaggregated
|
feat: Dynamically remove servers in PD (#5270)
|
2025-06-25 09:50:04 +08:00 |
|
llmapi
|
[https://nvbugs/5371480][fix] Enable test_phi3_small_8k (#6938)
|
2025-08-19 09:42:35 +08:00 |
|
others
|
[TRTLLM-5966][feat] Helix: extend mapping to support different CP types (#6816)
|
2025-08-14 09:00:02 -07:00 |
|
scaffolding
|
[https://nvbugs/5387375] fix(scaffolding): fix scaffolding aime test in test_e2e (#6140)
|
2025-07-18 10:34:37 +08:00 |
|
tools
|
enh: Add script to map tests <-> jenkins stages & vice-versa (#5177)
|
2025-07-19 00:50:40 +08:00 |
|
trt
|
[TRTLLM-5863][feat] Support MoE INT8 Weight-Only-Quantization in PyTorch Workflow (#6629)
|
2025-08-15 17:15:49 -04:00 |
|
utils
|
[None][fix] Migrate to new cuda binding package name (#6700)
|
2025-08-07 16:29:55 -04:00 |
|
conftest.py
|
[TRTLLM-5508][feat] check input tokens + improve error handling (#5170)
|
2025-08-05 18:27:43 +01:00 |
|
dump_checkpoint_stats.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
gc_utils.py
|
[nvbug 5273941] fix: broken cyclic reference detect (#5417)
|
2025-07-01 20:12:55 +08:00 |
|
profile_utils.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
pytest.ini
|
[ci] parallelize torch unittests (#5714)
|
2025-07-09 11:05:57 +03:00 |
|
test_model_runner_cpp.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
test_pip_install.py
|
[TRTLLM-7141][infra] Use repo mirrors to avoid intermittent network failures (#6836)
|
2025-08-15 11:16:07 +08:00 |