TensorRT-LLMs/tensorrt_llm/_torch/modules/mamba
Tailing Yuan 648196f8ae
[TRTLLM-9432][feat] Reduce synchronization and recompilation for qwen3-next (#9691)
Signed-off-by: Tailing Yuan <yuantailing@gmail.com>
2025-12-23 10:14:29 +08:00
..
__init__.py [TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494) 2025-06-01 13:56:44 +03:00
causal_conv1d.py [TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494) 2025-06-01 13:56:44 +03:00
layernorm_gated.py [TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494) 2025-06-01 13:56:44 +03:00
mamba2_metadata.py [TRTLLM-9432][feat] Reduce synchronization and recompilation for qwen3-next (#9691) 2025-12-23 10:14:29 +08:00
mamba2_mixer.py [None][fix] enable NvFP4/FP8 quantization for Nemotron-H architecture (#7589) 2025-09-09 11:42:22 +03:00
selective_state_update.py [TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494) 2025-06-01 13:56:44 +03:00
softplus.py [TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494) 2025-06-01 13:56:44 +03:00
ssd_bmm.py [TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494) 2025-06-01 13:56:44 +03:00
ssd_chunk_scan.py [TRTLLM-4921][feat] Enable chunked prefill for Nemotron-H (#6334) 2025-08-22 12:15:20 -04:00
ssd_chunk_state.py [TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494) 2025-06-01 13:56:44 +03:00
ssd_combined.py [TRTLLM-4921][feat] Enable chunked prefill for Nemotron-H (#6334) 2025-08-22 12:15:20 -04:00
ssd_state_passing.py [TRTLLM-4921][feat] Enable chunked prefill for Nemotron-H (#6334) 2025-08-22 12:15:20 -04:00