| .. |
|
allReduce
|
[None] [feat] Add model gpt-oss (#6645)
|
2025-08-07 03:04:18 -04:00 |
|
cudaCoreGemm
|
[NVBUG-5304516/5319741]Qwen2.5VL FP8 support (#5029)
|
2025-07-09 23:16:42 +08:00 |
|
fused_gated_gemm
|
Update TensorRT-LLM (#2755)
|
2025-02-11 03:01:00 +00:00 |
|
routing
|
Refactor the rest routing part for the routing kernels in the MoE TRT-LLM backend (#5771)
|
2025-07-11 16:37:56 +08:00 |
|
sampling
|
test: Test OOB access issue in penaltyKernel for endId=-1 (#4035)
|
2025-05-05 10:24:28 -07:00 |
|
smoothQuant
|
[None] [feat] Add model gpt-oss (#6645)
|
2025-08-07 03:04:18 -04:00 |
|
weightOnly
|
[feat] Optimizations on weight-only batched gemv kernel (#5420)
|
2025-06-30 10:20:16 +08:00 |
|
banRepeatNGramsKernelsTest.cpp
|
chore: remove usernames from comments (#3291)
|
2025-04-05 13:44:28 +08:00 |
|
CMakeLists.txt
|
Fix GEMM+AR fusion on blackwell (#5563)
|
2025-07-09 08:48:47 +08:00 |
|
decodingKernelTest.cpp
|
refactor: Clean up DecodingInput and DecodingOutput (#5617)
|
2025-07-01 14:31:42 +02:00 |
|
logitsBitmaskTest.cpp
|
Update TensorRT-LLM (#2755)
|
2025-02-11 03:01:00 +00:00 |
|
mixtureOfExpertsTest.cu
|
[TRTLLM-6744][feat] Remove input_sf swizzle for module WideEPMoE (#6231)
|
2025-08-08 11:13:42 +08:00 |
|
mlaChunkedPrefillTest.cu
|
[None] [feat] Add model gpt-oss (#6645)
|
2025-08-07 03:04:18 -04:00 |
|
mlaPreprocessTest.cu
|
[feat] Optimize KV Cache Reuse for MLA (#4869)
|
2025-06-13 11:03:05 +08:00 |
|
ropeTest.cu
|
feat: Add FP8 support for SM 120 (#3248)
|
2025-04-14 16:05:41 -07:00 |
|
shiftKCacheKernelTest.cu
|
Update TensorRT-LLM (#2755)
|
2025-02-11 03:01:00 +00:00 |
|
stopCriteriaKernelsTest.cpp
|
chore: remove usernames from comments (#3291)
|
2025-04-05 13:44:28 +08:00 |