This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-31 00:01:22 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
c289880afb
TensorRT-LLMs
/
tests
/
integration
/
test_lists
History
Pengbo Wang @ NVIDIA
c289880afb
[None][fix] fix kimi k2 serving and add test for Kimi-K2 (
#6589
)
...
Signed-off-by: Pengbo Wang <221450789+pengbowang-nv@users.noreply.github.com>
2025-08-05 18:05:33 +08:00
..
dev
Update (
#2978
)
2025-03-23 16:39:35 +08:00
qa
[None][fix] fix kimi k2 serving and add test for Kimi-K2 (
#6589
)
2025-08-05 18:05:33 +08:00
test-db
[None][fix] Revert commit
48ddc3d
& add test for disagg server with different max_num_tokens (
#6259
)
2025-08-04 15:09:51 +08:00
waives.txt
[TRTLLM-6674][feat] (Breaking Change) Hopper SWA non-cyclic kernels + KV reuse + Spec Dec (
#6379
)
2025-08-05 07:47:41 +00:00