This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-13 22:18:36 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
a2f271c8e0
TensorRT-LLMs
/
cpp
/
tensorrt_llm
/
pybind
/
runtime
History
Yuan Tong
a2f271c8e0
[TRTLLM-4406][feat] LLM sleep & wakeup Part 1: virtual device memory (
#5034
)
...
Signed-off-by: Yuan Tong <13075180+tongyuantongyu@users.noreply.github.com>
2025-08-04 13:51:01 +08:00
..
bindings.cpp
[TRTLLM-4406][feat] LLM sleep & wakeup Part 1: virtual device memory (
#5034
)
2025-08-04 13:51:01 +08:00
bindings.h
[TRTLLM-4987][feat] Partial support of context logits in TRTLLMSampler (
#4538
)
2025-06-01 03:32:43 +08:00
moeBindings.cpp
feat: large-scale EP(part 8: Online EP load balancer integration for PCIe fp8) (
#5226
)
2025-06-25 22:25:13 -07:00
moeBindings.h
feat: large-scale EP(part 2: MoE Load Balancer - core utilities) (
#4384
)
2025-05-20 17:53:48 +08:00