This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-12 14:03:48 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
910c070e88
TensorRT-LLMs
/
cpp
/
tensorrt_llm
/
kernels
/
tinygemm2
History
Bo Deng
6c694f85ba
[None][fix] fix TinyGemm accuracy issue. cherry-pick
https://github.com/NVIDIA/TensorRT-LLM/pull/10619
and
https://github.com/NVIDIA/TensorRT-LLM/pull/10873
(
#10990
)
...
Signed-off-by: Bo Deng <deemod@nvidia.com>
2026-01-26 11:02:22 -05:00
..
CMakeLists.txt
[TRTLLM-7775][feat] Integrate tinygemm2 for gpt-oss (
#7916
)
2025-10-02 10:47:04 -07:00
tinygemm2_cuda.cu
[TRTLLM-7775][feat] Integrate tinygemm2 for gpt-oss (
#7916
)
2025-10-02 10:47:04 -07:00
tinygemm2_kernel.cuh
[None][fix] fix TinyGemm accuracy issue. cherry-pick
https://github.com/NVIDIA/TensorRT-LLM/pull/10619
and
https://github.com/NVIDIA/TensorRT-LLM/pull/10873
(
#10990
)
2026-01-26 11:02:22 -05:00