|
allReduce
|
TensorRT-LLM v0.11 Update (#1969)
|
2024-07-17 20:45:02 +08:00 |
|
cudaCoreGemm
|
TensorRT-LLM v0.12 Update (#2164)
|
2024-08-29 17:25:07 +08:00 |
|
fused_gated_gemm
|
TensorRT-LLM v0.11 Update (#1969)
|
2024-07-17 20:45:02 +08:00 |
|
sampling
|
TensorRT-LLM v0.12 Update (#2164)
|
2024-08-29 17:25:07 +08:00 |
|
smoothQuant
|
TensorRT-LLM v0.10 update
|
2024-06-05 20:43:25 +08:00 |
|
weightOnly
|
TensorRT-LLM v0.11 Update (#1969)
|
2024-07-17 20:45:02 +08:00 |
|
banRepeatNGramsKernelsTest.cpp
|
TensorRT-LLM v0.10 update
|
2024-06-05 20:43:25 +08:00 |
|
ropeTest.cu
|
TensorRT-LLM v0.12 Update (#2164)
|
2024-08-29 17:25:07 +08:00 |
|
stopCriteriaKernelsTest.cpp
|
TensorRT-LLM v0.12 Update (#2164)
|
2024-08-29 17:25:07 +08:00 |