TensorRT-LLMs/cpp/tests/unit_tests/kernels
Yibin Li 32ae1564bd
update FP4 quantize layout (#3045)
Signed-off-by: Yibin Li <109242046+yibinl-nvidia@users.noreply.github.com>
2025-04-03 13:13:54 -04:00
..
allReduce update FP4 quantize layout (#3045) 2025-04-03 13:13:54 -04:00
cudaCoreGemm Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
fused_gated_gemm Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
sampling Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
smoothQuant Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
weightOnly Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
banRepeatNGramsKernelsTest.cpp Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
CMakeLists.txt Update (#2978) 2025-03-23 16:39:35 +08:00
decodingKernelTest.cpp refactor: Improve decoder finalize function (#3077) 2025-03-28 14:33:59 +08:00
logitsBitmaskTest.cpp Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
mixtureOfExpertsTest.cu update FP4 quantize layout (#3045) 2025-04-03 13:13:54 -04:00
ropeTest.cu Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
shiftKCacheKernelTest.cu Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
stopCriteriaKernelsTest.cpp Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00