This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-14 06:27:45 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
49fe089470
TensorRT-LLMs
/
cpp
/
kernels
History
Jhao-Ting Chen
0a09465089
[
https://nvbugs/5567586
][feat] Ampere xqa swa specdec for GPT-OSS Eagle3-one-model (
#8383
)
...
Signed-off-by: Jhao-Ting Chen <jhaotingc@nvidia.com>
2025-12-08 11:16:05 -08:00
..
fmha_v2
[None][chore] Weekly mass integration of release/1.1 -- rebase (
#9522
)
2025-11-29 21:48:48 +08:00
xqa
[
https://nvbugs/5567586
][feat] Ampere xqa swa specdec for GPT-OSS Eagle3-one-model (
#8383
)
2025-12-08 11:16:05 -08:00