TensorRT-LLMs/cpp/tensorrt_llm/kernels/fusedLayernormKernels
Yibin Li 32ae1564bd
update FP4 quantize layout (#3045)
Signed-off-by: Yibin Li <109242046+yibinl-nvidia@users.noreply.github.com>
2025-04-03 13:13:54 -04:00
..
CMakeLists.txt Update TensorRT-LLM (#2820) 2025-02-25 21:21:49 +08:00
fp4_converter.cuh update FP4 quantize layout (#3045) 2025-04-03 13:13:54 -04:00
layernorm_param.h Update TensorRT-LLM (#2783) 2025-02-13 18:40:22 +08:00
low_latency_layernorm.cuh Update TensorRT-LLM (#2783) 2025-02-13 18:40:22 +08:00
ws_layernorm_fp4_traits.cu Update TensorRT-LLM (#2783) 2025-02-13 18:40:22 +08:00
ws_layernorm.cuh Update TensorRT-LLM (#2783) 2025-02-13 18:40:22 +08:00
ws_layernorm.h Update TensorRT-LLM (#2783) 2025-02-13 18:40:22 +08:00