This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-12 22:14:03 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
87e0c8a749
TensorRT-LLMs
/
tests
/
unittest
/
_torch
/
auto_deploy
/
unit
History
gramnarayan
098b9ff226
[
#9147
][feat] AutoDeploy: Draft Target Speculative Decoding (
#9275
)
...
Signed-off-by: Govind Ramnarayan <105831528+govind-ramnarayan@users.noreply.github.com>
2025-12-04 05:13:49 +08:00
..
multigpu
[TRTLLM-8946][feat] Improved heuristics to detect shardable regions (
#9200
)
2025-12-02 22:08:19 +01:00
singlegpu
[
#9147
][feat] AutoDeploy: Draft Target Speculative Decoding (
#9275
)
2025-12-04 05:13:49 +08:00