TensorRT-LLMs/tests/unittest/_torch
Zongfei Jing c76b428e2e
[TRTLLM-9685] [feat] Add gather fc1 kernel by cuteDSL (#9618)
Signed-off-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com>
2025-12-11 16:21:32 +08:00
..
attention [None] [feat] Optimize the algorithm part of RocketKV (#9333) 2025-12-01 09:04:09 +08:00
auto_deploy [#9753][feat] AutoDeploy: Implement add rms_norm fusion (#9754) 2025-12-08 14:24:27 -08:00
compilation [TRTLLM-3105][feat] Add Piecewise CUDA Graph Support (#3804) 2025-05-09 11:04:01 +08:00
debugger Fix: fix nvbug 5356427 (#5464) 2025-06-25 22:24:26 +08:00
executor [https://nvbugs/5677746][fix] Use first PP rank's schedule result in other PP ranks to fix PP hang (#9659) 2025-12-08 18:43:52 -08:00
misc [None][fix] Fully resolve the tactic recovery issues in AutoTuner serialized cache (#9835) 2025-12-10 20:41:04 +08:00
modeling [TRTLLM-7967][chore] Add more tests (#9415) 2025-12-08 11:57:32 -08:00
models/checkpoints/hf [TRTLLM-7136][feat] Update load_weights method to include mapping parameter in checkpoint loaders (#9583) 2025-12-05 16:07:20 +01:00
modules [TRTLLM-9603][feat] Enable ConfigurableMoE test in the CI (#9645) 2025-12-08 10:19:40 +08:00
multi_gpu [None][feat] Enable NCCL_SYMMETRIC as default fallback for AllReduce (#9314) 2025-12-07 09:43:26 -08:00
multi_gpu_modeling [https://nvbugs/5515753][ci] Add NCCL_DEBUG=INFO flag to collect more info with CI failure. (#8440) 2025-11-20 12:43:13 -05:00
multimodal [None][fix] InputProcessor config naming convention fix (#8705) 2025-11-03 22:29:21 -08:00
ray_orchestrator [TRTLLM-9144][fix] enhance RPC robustness (#8711) 2025-12-02 21:37:59 +08:00
sampler [TRTLLM-6756][feat] Update BeamSearch for TorchSampler (#9660) 2025-12-09 10:44:01 +01:00
speculative [None][feat] Make 2-model spec dec use the 1-model kernels (Hopper) (#8810) 2025-12-09 11:06:31 -05:00
thop [TRTLLM-9685] [feat] Add gather fc1 kernel by cuteDSL (#9618) 2025-12-11 16:21:32 +08:00
helpers.py [#8733][feat] Add Llama4 MoE handling to AutoDeploy (#9556) 2025-12-04 08:03:33 +02:00
pattern_watcher.py [TRTLLM-3105][feat] Add Piecewise CUDA Graph Support (#3804) 2025-05-09 11:04:01 +08:00
test_connector.py [None][feat] KV Cache Connector API (#7228) 2025-08-28 23:09:27 -04:00