This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-14 06:27:45 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
918fedf952
TensorRT-LLMs
/
cpp
/
tensorrt_llm
/
pybind
/
runtime
History
Robin Kobus
918fedf952
[None][refactor] Simplify finish reasons handling in DecoderState (
#6524
)
...
Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>
2025-08-02 07:17:43 +02:00
..
bindings.cpp
[None][refactor] Simplify finish reasons handling in DecoderState (
#6524
)
2025-08-02 07:17:43 +02:00
bindings.h
[TRTLLM-4987][feat] Partial support of context logits in TRTLLMSampler (
#4538
)
2025-06-01 03:32:43 +08:00
moeBindings.cpp
feat: large-scale EP(part 8: Online EP load balancer integration for PCIe fp8) (
#5226
)
2025-06-25 22:25:13 -07:00
moeBindings.h
feat: large-scale EP(part 2: MoE Load Balancer - core utilities) (
#4384
)
2025-05-20 17:53:48 +08:00