TensorRT-LLMs/cpp/tensorrt_llm/pybind/runtime
Leslie Fang 31d04dfa12
[TRTLLM-9108][feat] Add test configurable moe module multi gpu (#10699)
Signed-off-by: leslie-fang25 <leslief@nvidia.com>
2026-01-23 10:16:58 +08:00
..
bindings.cpp [https://nvbugs/5782112][fix] Fix hanging issue for MNNVL Allreduce under PP (#10633) 2026-01-16 13:03:36 +08:00
bindings.h [TRTLLM-4987][feat] Partial support of context logits in TRTLLMSampler (#4538) 2025-06-01 03:32:43 +08:00
hostfunc.cpp [https://nvbugs/5643631][fix] Fix hostfunc seg fault (#10028) 2025-12-20 07:58:43 -05:00
hostfunc.h [TRTLLM-7028][feat] Enable guided decoding with speculative decoding (part 2: one-model engine) (#6948) 2025-09-03 15:16:11 -07:00
moeBindings.cpp [TRTLLM-9108][feat] Add test configurable moe module multi gpu (#10699) 2026-01-23 10:16:58 +08:00
moeBindings.h feat: large-scale EP(part 2: MoE Load Balancer - core utilities) (#4384) 2025-05-20 17:53:48 +08:00