TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Wanli Jiang 14554ab3f3 [None][feat] Support multi-gpu running for nemotron-v3-nano and super (#10118 ) Signed-off-by: Wanli Jiang <35160485+Wanli-Jiang@users.noreply.github.com>		2025-12-26 11:23:14 +08:00
..
__init__.py	[TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494 )	2025-06-01 13:56:44 +03:00
causal_conv1d.py	[TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494 )	2025-06-01 13:56:44 +03:00
layernorm_gated.py	[TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494 )	2025-06-01 13:56:44 +03:00
mamba2_metadata.py	[TRTLLM-9432][feat] Reduce synchronization and recompilation for qwen3-next (#9691 )	2025-12-23 10:14:29 +08:00
mamba2_mixer.py	[None][feat] Support multi-gpu running for nemotron-v3-nano and super (#10118 )	2025-12-26 11:23:14 +08:00
selective_state_update.py	[TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494 )	2025-06-01 13:56:44 +03:00
softplus.py	[TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494 )	2025-06-01 13:56:44 +03:00
ssd_bmm.py	[TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494 )	2025-06-01 13:56:44 +03:00
ssd_chunk_scan.py	[TRTLLM-4921][feat] Enable chunked prefill for Nemotron-H (#6334 )	2025-08-22 12:15:20 -04:00
ssd_chunk_state.py	[TRTLLM-4783][feat] Mamba2 kernel updates for Nemotron-H (#4494 )	2025-06-01 13:56:44 +03:00
ssd_combined.py	[TRTLLM-4921][feat] Enable chunked prefill for Nemotron-H (#6334 )	2025-08-22 12:15:20 -04:00
ssd_state_passing.py	[TRTLLM-4921][feat] Enable chunked prefill for Nemotron-H (#6334 )	2025-08-22 12:15:20 -04:00