TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-05 18:51:38 +08:00

History

Tailing Yuan 648196f8ae [TRTLLM-9432][feat] Reduce synchronization and recompilation for qwen3-next (#9691 ) Signed-off-by: Tailing Yuan <yuantailing@gmail.com>		2025-12-23 10:14:29 +08:00
..
__init__.py
chunk_delta_h.py
chunk_o.py
chunk_scaled_dot_kkt.py
chunk.py
cumsum.py
fused_recurrent.py
fused_sigmoid_gating_recurrent.py
index.py
l2norm.py	[TRTLLM-9432][feat] Reduce synchronization and recompilation for qwen3-next (#9691 )	2025-12-23 10:14:29 +08:00
layernorm_gated.py
op.py
solve_tril.py
utils.py
wy_fast.py