..
_torch
[None][feat] Apply AutoTuner to fp8_block_scale_deep_gemm to trigger JIT ahead of time. ( #7113 )
2025-08-25 10:48:31 +08:00
api_stability
[TRTLLM-7326][feat] Add standalone multimodal encoder ( #6743 )
2025-08-19 21:42:50 -07:00
bindings
[TRTLLM-6881][feat] Include attention dp rank info with KV cache events ( #6563 )
2025-08-07 14:17:07 +02:00
disaggregated
feat: Dynamically remove servers in PD ( #5270 )
2025-06-25 09:50:04 +08:00
llmapi
[None][chore] Mass integration of release/1.0 ( #6864 )
2025-08-22 09:25:15 +08:00
others
[TRTLLM-7096][infra] Testing cache transmission functionality in Python ( #7025 )
2025-08-25 09:47:39 +08:00
scaffolding
[ https://nvbugs/5387375 ] fix(scaffolding): fix scaffolding aime test in test_e2e ( #6140 )
2025-07-18 10:34:37 +08:00
tools
enh: Add script to map tests <-> jenkins stages & vice-versa ( #5177 )
2025-07-19 00:50:40 +08:00
trt
[TRTLLM-5863][feat] Support MoE INT8 Weight-Only-Quantization in PyTorch Workflow ( #6629 )
2025-08-15 17:15:49 -04:00
utils
[TRTLLM-4921][feat] Enable chunked prefill for Nemotron-H ( #6334 )
2025-08-22 12:15:20 -04:00
conftest.py
[TRTLLM-5508][feat] check input tokens + improve error handling ( #5170 )
2025-08-05 18:27:43 +01:00
dump_checkpoint_stats.py
Update TensorRT-LLM ( #2936 )
2025-03-18 21:25:19 +08:00
gc_utils.py
[nvbug 5273941] fix: broken cyclic reference detect ( #5417 )
2025-07-01 20:12:55 +08:00
profile_utils.py
Update TensorRT-LLM ( #2936 )
2025-03-18 21:25:19 +08:00
pytest.ini
[ci] parallelize torch unittests ( #5714 )
2025-07-09 11:05:57 +03:00
test_model_runner_cpp.py
Update TensorRT-LLM ( #2936 )
2025-03-18 21:25:19 +08:00
test_pip_install.py
[TRTLLM-7141][infra] Use repo mirrors to avoid intermittent network failures ( #6836 )
2025-08-15 11:16:07 +08:00