This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-29 15:15:08 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
49dcc0df53
TensorRT-LLMs
/
cpp
/
tests
/
unit_tests
/
kernels
History
ChristinaZ
c5fb692a7d
Refactor the rest routing part for the routing kernels in the MoE TRT-LLM backend (
#5771
)
...
Signed-off-by: Christina Zhang <83400082+ChristinaZ@users.noreply.github.com>
2025-07-11 16:37:56 +08:00
..
allReduce
cudaCoreGemm
fused_gated_gemm
routing
Refactor the rest routing part for the routing kernels in the MoE TRT-LLM backend (
#5771
)
2025-07-11 16:37:56 +08:00
sampling
smoothQuant
weightOnly
banRepeatNGramsKernelsTest.cpp
CMakeLists.txt
decodingKernelTest.cpp
logitsBitmaskTest.cpp
mixtureOfExpertsTest.cu
mlaChunkedPrefillTest.cu
mlaPreprocessTest.cu
ropeTest.cu
shiftKCacheKernelTest.cu
stopCriteriaKernelsTest.cpp