TensorRT-LLMs/cpp/tests/kernels
2024-08-29 17:25:07 +08:00
..
allReduce TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
cudaCoreGemm TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
fused_gated_gemm TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
sampling TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
smoothQuant TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
weightOnly TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
banRepeatNGramsKernelsTest.cpp TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
decodingKernelTest.cpp TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
mixtureOfExpertsTest.cu TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
ropeTest.cu TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
shiftKCacheKernelTest.cu TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
stopCriteriaKernelsTest.cpp TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00