TensorRT-LLMs/tests/unittest/_torch/modules
Anthony Chang 2198587b35
[https://nvbugs/5378031] [feat] Hopper W4A8 MoE supports ModelOpt ckpt for PyT backend (#6200)
Signed-off-by: Anthony Chang <27950904+rosenrodt@users.noreply.github.com>
2025-08-13 21:24:40 +08:00
..
tests_lora_modules added loraOp into lora layer + test for mlp and comparison to lora plugin (#3455) 2025-04-17 12:48:27 +08:00
test_fused_moe.py [https://nvbugs/5378031] [feat] Hopper W4A8 MoE supports ModelOpt ckpt for PyT backend (#6200) 2025-08-13 21:24:40 +08:00
test_moe_host_sharer.py feat: large-scale EP(part 6: Online EP load balancer integration for GB200 nvfp4) (#4818) 2025-06-08 10:25:18 +08:00
test_moe_load_balancer.py [None][perf] Improve the performance of online EPLB on Hopper by better overlapping (#6624) 2025-08-12 09:25:13 +08:00
test_moe_routing.py [None] [feat] Add model gpt-oss (#6645) 2025-08-07 03:04:18 -04:00
test_triton_linear.py [None] [feat] Add model gpt-oss (#6645) 2025-08-07 03:04:18 -04:00