TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-04 18:21:52 +08:00

History

彭晋韬(jtao peng) 211c44b951 [None][feat] Adding torch ext API for FusedAddRMSNormQuant kernel (#9905 ) Signed-off-by: jintaop <jintaop@nvidia.com>		2026-01-15 07:29:15 +08:00
..
CMakeLists.txt
fp4_converter.cuh	[None][fix] Introduce inline namespace to avoid symbol collision (#9541 )	2025-12-12 23:32:15 +08:00
layernorm_param.h	[None][fix] Introduce inline namespace to avoid symbol collision (#9541 )	2025-12-12 23:32:15 +08:00
low_latency_layernorm.cuh	[None][feat] Adding torch ext API for FusedAddRMSNormQuant kernel (#9905 )	2026-01-15 07:29:15 +08:00
ws_layernorm_fp4_traits.cu	[None][fix] Introduce inline namespace to avoid symbol collision (#9541 )	2025-12-12 23:32:15 +08:00
ws_layernorm.cuh	[None][feat] Adding torch ext API for FusedAddRMSNormQuant kernel (#9905 )	2026-01-15 07:29:15 +08:00
ws_layernorm.h	[None][fix] Introduce inline namespace to avoid symbol collision (#9541 )	2025-12-12 23:32:15 +08:00