This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-06 11:11:36 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
bdf6953ddc
TensorRT-LLMs
/
tests
/
unittest
/
_torch
/
auto_deploy
/
unit
History
Lucas Liebenwein
1bbe71b3ed
[
#10244
][feat] AutoDeploy: separate prefill/decode in flashinfer (
#10252
)
...
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
2025-12-31 17:01:24 -05:00
..
multigpu
[
#9717
][chore] Refactor MoE code to use enums (
#9910
)
2025-12-22 15:14:56 -05:00
singlegpu
[
#10244
][feat] AutoDeploy: separate prefill/decode in flashinfer (
#10252
)
2025-12-31 17:01:24 -05:00