..
_torch
[ #9626 ][feat] Add an auto-deploy transform for using cutlass FP4 MoE kernels ( #10304 )
2025-12-29 23:18:15 +02:00
api_stability
[TRTLLM-9654][feat] Support DeepSeek-V32 chat template ( #9814 )
2025-12-19 17:05:38 +08:00
bindings
[ https://nvbugs/5643631 ][fix] Fix hostfunc seg fault ( #10028 )
2025-12-20 07:58:43 -05:00
disaggregated
[TRTLLM-8920][feat] decouple disagg service from fastapi ( #8714 )
2025-12-05 10:44:16 +08:00
executor
[ https://nvbugs/5720482 ][fix] Fix test rpc streaming ( #9902 )
2025-12-13 01:14:43 -08:00
llmapi
[ https://nvbugs/5753250 ][fix] Fix undefined local variable in responses utils ( #10154 )
2025-12-28 06:59:32 +08:00
others
[None][fix] [Gemma3] Fix RoPE for local attention for Gemma3 ( #9961 )
2025-12-27 11:50:59 -08:00
scaffolding
[None][feat] Refactor scaffolding streaming feature and fix openai wo… ( #8622 )
2025-10-30 16:02:40 +08:00
tools
[TRTC-121] [feat] Add recipe selector UI to complement the recipe database ( #10125 )
2025-12-24 23:56:54 -05:00
trt
[TRTLLM-8682][chore] Remove auto_parallel module ( #8329 )
2025-10-22 20:53:08 -04:00
utils
[TRTLLM-8376][feat] top-p optimization (removes redundant softmax) ( #9411 )
2025-11-25 18:46:48 +01:00
conftest.py
[TRTLLM-9737][chore] Add rl perf reproduce script and enhance the robustness of Ray tests ( #9939 )
2025-12-24 15:27:01 +08:00
dump_checkpoint_stats.py
Update TensorRT-LLM ( #2936 )
2025-03-18 21:25:19 +08:00
gc_utils.py
[nvbug 5273941] fix: broken cyclic reference detect ( #5417 )
2025-07-01 20:12:55 +08:00
profile_utils.py
Update TensorRT-LLM ( #2936 )
2025-03-18 21:25:19 +08:00
pytest.ini
[TRTLLM-9181][feat] improve disagg-server prometheus metrics; synchronize workers' clocks when workers are dynamic ( #9726 )
2025-12-16 05:16:32 -08:00
test_model_runner_cpp.py
Update TensorRT-LLM ( #2936 )
2025-03-18 21:25:19 +08:00
test_pip_install.py
[ https://nvbugs/5616189 ][fix] Make more cases use local cached models ( #8935 )
2025-11-11 03:14:05 -08:00