TensorRT-LLMs/tensorrt_llm/_torch/modules/fla
Tailing Yuan 648196f8ae
[TRTLLM-9432][feat] Reduce synchronization and recompilation for qwen3-next (#9691)
Signed-off-by: Tailing Yuan <yuantailing@gmail.com>
2025-12-23 10:14:29 +08:00
..
__init__.py
chunk_delta_h.py
chunk_o.py
chunk_scaled_dot_kkt.py
chunk.py
cumsum.py
fused_recurrent.py
fused_sigmoid_gating_recurrent.py
index.py
l2norm.py [TRTLLM-9432][feat] Reduce synchronization and recompilation for qwen3-next (#9691) 2025-12-23 10:14:29 +08:00
layernorm_gated.py
op.py
solve_tril.py
utils.py
wy_fast.py