TensorRT-LLMs/tests/unittest
Chenghao Zhang a6d20f6f9b
[None][feat] AutoDeploy: Add FP8 MOE for Nemotron (#8599)
Signed-off-by: Chenghao Zhang <211069071+nvchenghaoz@users.noreply.github.com>
Signed-off-by: Fridah-nv <201670829+Fridah-nv@users.noreply.github.com>
Signed-off-by: nvchenghaoz <211069071+nvchenghaoz@users.noreply.github.com>
Co-authored-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com>
Co-authored-by: Fridah-nv <201670829+Fridah-nv@users.noreply.github.com>
2025-10-25 15:26:45 -04:00
..
_torch [None][feat] AutoDeploy: Add FP8 MOE for Nemotron (#8599) 2025-10-25 15:26:45 -04:00
api_stability [TRTLLM-8513][feat] Add back worker extension (#8482) 2025-10-24 20:30:28 -04:00
bindings [None][infra] Skip failed cases for main branch (#8293) 2025-10-12 08:04:09 -07:00
disaggregated [TRTLLM-7843][feat] implement disagg cluster auto-scaling (#8215) 2025-10-21 17:25:07 -04:00
executor [https://nvbugs/5494718][fix] Fix Single GPU Multi-node issue and OOM on DGX Spark (#8514) 2025-10-24 19:09:07 -07:00
llmapi [TRTLLM-8737][feat] Support media_io_kwargs on trtllm-serve (#8528) 2025-10-24 12:53:40 -04:00
others [TRTLLM-8812][chore] Limit the scope of pybind based CacheTransceiverConfig (#8558) 2025-10-23 10:32:09 -04:00
scaffolding [https://nvbugs/5387375] fix(scaffolding): fix scaffolding aime test in test_e2e (#6140) 2025-07-18 10:34:37 +08:00
tools [None][infra] Waive failed cases for main on 0929 (#8053) 2025-09-29 02:46:02 -04:00
trt [TRTLLM-8682][chore] Remove auto_parallel module (#8329) 2025-10-22 20:53:08 -04:00
utils [TRTLLM-8507][fix] Fix ray resource cleanup and error handling in LoRA test (#8175) 2025-10-14 23:46:30 +08:00
conftest.py [TRTLLM-4866] [test] Support waiving unit tests by waives.txt (#8359) 2025-10-20 09:52:51 +08:00
dump_checkpoint_stats.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
gc_utils.py [nvbug 5273941] fix: broken cyclic reference detect (#5417) 2025-07-01 20:12:55 +08:00
profile_utils.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
pytest.ini [TRTLLM-8189][chore] enhance GenerationExecutor with RPC (part1) (#5543) 2025-10-05 17:28:20 +08:00
test_model_runner_cpp.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
test_pip_install.py [None] [chore] Add architecture-specific ATTRIBUTIONS files (#8468) 2025-10-20 16:29:15 -04:00