This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-14 06:27:45 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
c9dca69e1b
TensorRT-LLMs
/
examples
/
disaggregated
/
clients
History
Pengyun Lin
a15e33351d
[None][fix] Revert commit
48ddc3d
& add test for disagg server with different max_num_tokens (
#6259
)
...
Signed-off-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com>
2025-08-04 15:09:51 +08:00
..
disagg_client.py
Refactor the first token response in PD (
#4692
)
2025-06-04 09:11:23 +08:00
long_prompts.json
[None][fix] Revert commit
48ddc3d
& add test for disagg server with different max_num_tokens (
#6259
)
2025-08-04 15:09:51 +08:00
prompts.json
Update TensorRT-LLM (
#2873
)
2025-03-11 21:13:42 +08:00