..
allReduce
[None] [feat] Add model gpt-oss ( #6645 )
2025-08-07 03:04:18 -04:00
cudaCoreGemm
[NVBUG-5304516/5319741]Qwen2.5VL FP8 support ( #5029 )
2025-07-09 23:16:42 +08:00
fused_gated_gemm
Update TensorRT-LLM ( #2755 )
2025-02-11 03:01:00 +00:00
routing
[None][feat] Add single block version renormalized routing kernel ( #6756 )
2025-08-17 13:47:13 +08:00
sampling
test: Test OOB access issue in penaltyKernel for endId=-1 ( #4035 )
2025-05-05 10:24:28 -07:00
smoothQuant
[None] [feat] Add model gpt-oss ( #6645 )
2025-08-07 03:04:18 -04:00
weightOnly
[feat] Optimizations on weight-only batched gemv kernel ( #5420 )
2025-06-30 10:20:16 +08:00
banRepeatNGramsKernelsTest.cpp
chore: remove usernames from comments ( #3291 )
2025-04-05 13:44:28 +08:00
CMakeLists.txt
[TRTLLM-6743][feat] Optimize and refactor alltoall in WideEP ( #6973 )
2025-08-24 08:15:29 -04:00
decodingKernelTest.cpp
refactor: Clean up DecodingInput and DecodingOutput ( #5617 )
2025-07-01 14:31:42 +02:00
fusedMoeCommKernelTest.cpp
[TRTLLM-6743][feat] Optimize and refactor alltoall in WideEP ( #6973 )
2025-08-24 08:15:29 -04:00
logitsBitmaskTest.cpp
Update TensorRT-LLM ( #2755 )
2025-02-11 03:01:00 +00:00
mixtureOfExpertsTest.cu
[None][perf] Make finalize fusion part of the tactic selection logic ( #6915 )
2025-08-21 14:08:03 -07:00
mlaChunkedPrefillTest.cu
[None][feat] Use Separate QKV Input Layout for Context MLA ( #6538 )
2025-08-19 22:04:48 +08:00
mlaPreprocessTest.cu
[None][feat] Use Separate QKV Input Layout for Context MLA ( #6538 )
2025-08-19 22:04:48 +08:00
ropeTest.cu
feat: Add FP8 support for SM 120 ( #3248 )
2025-04-14 16:05:41 -07:00
shiftKCacheKernelTest.cu
Update TensorRT-LLM ( #2755 )
2025-02-11 03:01:00 +00:00
stopCriteriaKernelsTest.cpp
chore: remove usernames from comments ( #3291 )
2025-04-05 13:44:28 +08:00