TensorRT-LLMs/tests/unittest/_torch/attention
yunruis 51545560da
[TRTLLM-8803][feat] Add rope and uk-bgemm overlap for mla generation (#8495)
Signed-off-by: yunruis <205571022+yunruis@users.noreply.github.com>
2025-11-06 17:39:57 +08:00
..
sparse [TRTLLM-8768][chore] Fuse QK down_proj with indexer K + weight_proj for FP4 ckpt (#8771) 2025-11-05 07:57:09 -08:00
test_attention_mla.py [TRTLLM-8803][feat] Add rope and uk-bgemm overlap for mla generation (#8495) 2025-11-06 17:39:57 +08:00
test_attention_no_cache.py [None][ci] move unittests to sub-directories (#6635) 2025-08-20 05:42:22 -04:00
test_attention.py [None][ci] move unittests to sub-directories (#6635) 2025-08-20 05:42:22 -04:00
test_flashinfer_attention.py [None][ci] move unittests to sub-directories (#6635) 2025-08-20 05:42:22 -04:00
test_flashinfer_star_attn.py [None][ci] move unittests to sub-directories (#6635) 2025-08-20 05:42:22 -04:00
test_vanilla_attention.py [None][ci] move unittests to sub-directories (#6635) 2025-08-20 05:42:22 -04:00