This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-14 06:27:45 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
85b4ae26b7
TensorRT-LLMs
/
cpp
/
kernels
/
xqa
/
test
History
Ransiki
19b7524ff6
[None][feat] Add vLLM KV Pool support for XQA kernel (
#6013
)
...
Signed-off-by: Ransiki Zhang <ransikiz@nvidia.com>
2025-08-06 09:29:37 +08:00
..
refAttention.cpp
[TRTLLM-6674][feat] (Breaking Change) Hopper SWA non-cyclic kernels + KV reuse + Spec Dec (
#6379
)
2025-08-05 07:47:41 +00:00
refAttention.h
[None][feat] Add vLLM KV Pool support for XQA kernel (
#6013
)
2025-08-06 09:29:37 +08:00
test.cpp
[None][feat] Add vLLM KV Pool support for XQA kernel (
#6013
)
2025-08-06 09:29:37 +08:00
warmup.cu
[feat] Support XQA-based MLA on SM120 (
#4858
)
2025-06-06 22:32:49 +08:00