TensorRT-LLMs/cpp/tensorrt_llm/kernels/fusedLayernormKernels
Wanli Jiang 421eb9e39c
[None][feat] Optimize NemotronH model with elementwise and nvfp4 fusion (#11273)
Signed-off-by: Wanli Jiang <35160485+Wanli-Jiang@users.noreply.github.com>
2026-02-12 09:25:31 -05:00
..
CMakeLists.txt Update TensorRT-LLM (#2820) 2025-02-25 21:21:49 +08:00
fp4_converter.cuh [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
layernorm_param.h [None][feat] Optimize NemotronH model with elementwise and nvfp4 fusion (#11273) 2026-02-12 09:25:31 -05:00
low_latency_layernorm.cuh [None][feat] Optimize NemotronH model with elementwise and nvfp4 fusion (#11273) 2026-02-12 09:25:31 -05:00
ws_layernorm_fp4_traits.cu [None][feat] Optimize NemotronH model with elementwise and nvfp4 fusion (#11273) 2026-02-12 09:25:31 -05:00
ws_layernorm.cuh [None][feat] Optimize NemotronH model with elementwise and nvfp4 fusion (#11273) 2026-02-12 09:25:31 -05:00
ws_layernorm.h [None][feat] Optimize NemotronH model with elementwise and nvfp4 fusion (#11273) 2026-02-12 09:25:31 -05:00