TensorRT-LLMs/tests/unittest/_torch/thop/parallel
Rundong Li f1b85fea4c
[None][feat] Integrate cuda.tile RMS norm kernels (#9725)
Signed-off-by: Rundong (David) Li <davidli@nvidia.com>
Co-authored-by: Jinman Xie <jinmanx@nvidia.com>
Co-authored-by: Alexey Bylinkin <abylinkin@nvidia.com>
Co-authored-by: Qiqi Xiao <qiqix@nvidia.com>
Co-authored-by: Biao Wang <biaow@nvidia.com>
Co-authored-by: Thomas Schmid <thschmid@nvidia.com>
2026-02-02 19:44:27 +08:00
..
deep_gemm_tests.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_causal_conv1d_op.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_cublas_mm.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_cuda_tile_custom_ops.py [None][feat] Integrate cuda.tile RMS norm kernels (#9725) 2026-02-02 19:44:27 +08:00
test_custom_ops.py [TRTLLM-9390][chore] Add Fake OPs for One-Sided AlltoAll. (#11002) 2026-01-27 15:55:07 +08:00
test_cute_dsl_moe.py [None][fix] Fix CuteDSL MoE unittest (#10983) 2026-01-26 08:34:17 +08:00
test_dsv3_fused_a_gemm.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_dsv3_router_gemm.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_finegrained_mixed_dtype_gemm.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_fp4_bmm_quantize.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_fp4_calculate_global_scale.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_fp4_gemm_quantize.py [OMNIML-2336][feat] Add NVFP4 x FP8 (#6809) 2025-09-04 09:03:38 -07:00
test_fp4_linear.py [None][chore] Enable tvm_ffi for cute dsl nvfp4_gemm to reduce host overhead. (#9690) 2025-12-08 13:28:11 +08:00
test_fp4_swizzle.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_fp8_block_scale_gemm.py [None][feat] Drop non-deepgemm fp8 block scale gemm (#10256) 2025-12-25 14:52:52 +08:00
test_fp8_linear.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_fp8_per_tensor_scale_tllmg_gemm.py [TRTLLM-4629] [feat] Add support of CUDA13 and sm103 devices (#7568) 2025-09-16 09:56:18 +08:00
test_fp8_quantize.py [None][perf] Use fp8 quant kernel in DS3.2 indexer module (#8701) 2025-10-29 12:45:09 +08:00
test_fp8_rowwise_linear.py [None][infra] Remove invaild waived tests which not in release branch (#8841) 2025-11-20 12:43:13 -05:00
test_fused_qk_norm_rope.py [None][feat] Fused kernels (qknormrope + moe routing) and two-model MTP support for glm4moe (#9852) 2025-12-14 10:47:24 +08:00
test_helix_postprocess.py [TRTLLM-9493][feat] Add helixPostProcessNative kernel for cp_dim=2 (#9924) 2025-12-12 16:49:25 -08:00
test_indexer_topk.py [https://nvbugs/5720357][fix] Fix indice offset overflow in custom Top-K kernel and corresponding UT case (#10027) 2025-12-19 14:58:01 -05:00
test_logits_bitmask_op.py [TRTLLM-8209][feat] Support new structural tag API (upgrade XGrammar to 0.1.25) (#7893) 2025-09-23 09:10:09 +08:00
test_mamba2_chunk_ss_update.py [TRTLLM-10062][feat] Enable MTP for Nemotron Super (#10754) 2026-01-26 11:23:26 -05:00
test_mamba_conv1d_op.py [https://nvbugs/5640873][fix] Move thop tests to pre-merge (#9094) 2025-11-13 13:08:13 +08:00
test_noaux_tc.py [None][feat] Add routing support for the new model for both cutlass and trtllm moe backend (#9792) 2025-12-15 19:59:08 -08:00
test_scaled_mm.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_selective_scan_op.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_tinygemm2.py [TRTLLM-7775][feat] Integrate tinygemm2 for gpt-oss (#7916) 2025-10-02 10:47:04 -07:00
test_tllmg_bmm.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_w4a8_linear.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_w4a8_mxfp4_mxfp8_gemm.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_w4a16_linear.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_weight_only_quant_gemm.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00
test_weight_only_quant_linear.py [TRTLLM-7457][ci] Update unittest parallel config (#7297) 2025-08-29 09:28:04 +08:00