This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-14 15:03:48 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
9d6e87aed3
TensorRT-LLMs
/
cpp
/
tensorrt_llm
/
pybind
/
runtime
History
Shiyu Li
9d6e87aed3
[None][fix] Cherry-Pick MNNVLAllreduce Fixes into release/1.1.0rc2 branch (
#7487
)
...
Signed-off-by: Shiyu Li <shili@nvidia.com>
2025-09-05 12:08:36 +08:00
..
bindings.cpp
[None][fix] Cherry-Pick MNNVLAllreduce Fixes into release/1.1.0rc2 branch (
#7487
)
2025-09-05 12:08:36 +08:00
bindings.h
[TRTLLM-4987][feat] Partial support of context logits in TRTLLMSampler (
#4538
)
2025-06-01 03:32:43 +08:00
moeBindings.cpp
feat: large-scale EP(part 8: Online EP load balancer integration for PCIe fp8) (
#5226
)
2025-06-25 22:25:13 -07:00
moeBindings.h
feat: large-scale EP(part 2: MoE Load Balancer - core utilities) (
#4384
)
2025-05-20 17:53:48 +08:00