| .. |
|
_torch
|
[OMNIML-2336][feat] w4a8 nvfp4 fp8 exports scale factor properly (#8180)
|
2025-10-15 13:41:27 +08:00 |
|
api_stability
|
[None][chore] set the default value of max_num_tokens explicitly (#8208)
|
2025-10-14 23:03:02 -07:00 |
|
bindings
|
[None][infra] Skip failed cases for main branch (#8293)
|
2025-10-12 08:04:09 -07:00 |
|
disaggregated
|
[TRTLLM-7846][feat] implement etcd storage for disagg cluster (#8210)
|
2025-10-14 16:48:41 -04:00 |
|
executor
|
[None][ci] waive several rpc tests (#8349)
|
2025-10-14 03:12:49 -07:00 |
|
llmapi
|
[TRTLLM-8507][fix] Fix ray resource cleanup and error handling in LoRA test (#8175)
|
2025-10-14 23:46:30 +08:00 |
|
others
|
[None][feat] Add request timing breakdown option in benchmark_serving (#8128)
|
2025-10-10 09:24:54 -07:00 |
|
scaffolding
|
[https://nvbugs/5387375] fix(scaffolding): fix scaffolding aime test in test_e2e (#6140)
|
2025-07-18 10:34:37 +08:00 |
|
tools
|
[None][infra] Waive failed cases for main on 0929 (#8053)
|
2025-09-29 02:46:02 -04:00 |
|
trt
|
[None][chroe] Rename TensorRT-LLM to TensorRT LLM for source code. (#7851)
|
2025-09-25 21:02:35 +08:00 |
|
utils
|
[TRTLLM-8507][fix] Fix ray resource cleanup and error handling in LoRA test (#8175)
|
2025-10-14 23:46:30 +08:00 |
|
conftest.py
|
[TRTLLM-8414][chore] BREAKING CHANGE: refine sampling strategy selection (#8132)
|
2025-10-08 15:46:50 +02:00 |
|
dump_checkpoint_stats.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
gc_utils.py
|
[nvbug 5273941] fix: broken cyclic reference detect (#5417)
|
2025-07-01 20:12:55 +08:00 |
|
profile_utils.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
pytest.ini
|
[TRTLLM-8189][chore] enhance GenerationExecutor with RPC (part1) (#5543)
|
2025-10-05 17:28:20 +08:00 |
|
test_model_runner_cpp.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
test_pip_install.py
|
[TRTLLM-7989][infra] Bundle UCX and NIXL libs in the TRTLLM python package (#7766)
|
2025-09-22 16:43:35 +08:00 |