TensorRT-LLMs/tests/unittest/_torch
Mike Iovine 07c76a5fac
[None][feat] Make 2-model spec dec use the 1-model kernels (Hopper) (#8810)
Signed-off-by: Mike Iovine <6158008+mikeiovine@users.noreply.github.com>
2025-12-09 11:06:31 -05:00
..
attention [None] [feat] Optimize the algorithm part of RocketKV (#9333) 2025-12-01 09:04:09 +08:00
auto_deploy [#9753][feat] AutoDeploy: Implement add rms_norm fusion (#9754) 2025-12-08 14:24:27 -08:00
compilation [TRTLLM-3105][feat] Add Piecewise CUDA Graph Support (#3804) 2025-05-09 11:04:01 +08:00
debugger Fix: fix nvbug 5356427 (#5464) 2025-06-25 22:24:26 +08:00
executor [https://nvbugs/5677746][fix] Use first PP rank's schedule result in other PP ranks to fix PP hang (#9659) 2025-12-08 18:43:52 -08:00
misc [TRTLLM-9372][feat] Enable CuteDSL MoE with Large EP (#9592) 2025-12-05 22:08:52 -08:00
modeling [TRTLLM-7967][chore] Add more tests (#9415) 2025-12-08 11:57:32 -08:00
models/checkpoints/hf [TRTLLM-7136][feat] Update load_weights method to include mapping parameter in checkpoint loaders (#9583) 2025-12-05 16:07:20 +01:00
modules [TRTLLM-9603][feat] Enable ConfigurableMoE test in the CI (#9645) 2025-12-08 10:19:40 +08:00
multi_gpu [None][feat] Enable NCCL_SYMMETRIC as default fallback for AllReduce (#9314) 2025-12-07 09:43:26 -08:00
multi_gpu_modeling [https://nvbugs/5515753][ci] Add NCCL_DEBUG=INFO flag to collect more info with CI failure. (#8440) 2025-11-20 12:43:13 -05:00
multimodal [None][fix] InputProcessor config naming convention fix (#8705) 2025-11-03 22:29:21 -08:00
ray_orchestrator [TRTLLM-9144][fix] enhance RPC robustness (#8711) 2025-12-02 21:37:59 +08:00
sampler [TRTLLM-6756][feat] Update BeamSearch for TorchSampler (#9660) 2025-12-09 10:44:01 +01:00
speculative [None][feat] Make 2-model spec dec use the 1-model kernels (Hopper) (#8810) 2025-12-09 11:06:31 -05:00
thop [https://nvbugs/5575841] [fix] Nvbug 5575841: Remove additional test waivers for TestMoEFP4 (#9788) 2025-12-09 13:37:55 +00:00
helpers.py [#8733][feat] Add Llama4 MoE handling to AutoDeploy (#9556) 2025-12-04 08:03:33 +02:00
pattern_watcher.py [TRTLLM-3105][feat] Add Piecewise CUDA Graph Support (#3804) 2025-05-09 11:04:01 +08:00
test_connector.py [None][feat] KV Cache Connector API (#7228) 2025-08-28 23:09:27 -04:00