TensorRT-LLMs/tensorrt_llm/executor
Enwei Zhu 3fe4a1842a
fix: Register MoeLoadBalancerConfig to serialization.py (#4864)
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
2025-06-03 19:22:36 +08:00
..
__init__.py chore: rename ExecutorBindingsWorker/Proxy (#4716) 2025-05-29 10:32:35 +08:00
executor.py chore: rename ExecutorBindingsWorker/Proxy (#4716) 2025-05-29 10:32:35 +08:00
ipc.py fix [nvbug5256044]: bench hang due to llmapi ipc (#4798) 2025-06-03 10:10:53 +08:00
postproc_worker.py fix: [nvbugs/5066257] serialization improvments (#3869) 2025-05-23 13:06:29 +08:00
proxy.py fix [nvbug5256044]: bench hang due to llmapi ipc (#4798) 2025-06-03 10:10:53 +08:00
request.py feat: Add multimodal embedding field in LlmRequest (#3855) 2025-05-01 12:23:30 +08:00
result.py [feat][TRTLLM-5018] Dis serving python runtime trt backend (#4243) 2025-05-22 22:01:06 -04:00
serialization.py fix: Register MoeLoadBalancerConfig to serialization.py (#4864) 2025-06-03 19:22:36 +08:00
utils.py opt: the perormance for dist-agg streaming generation (#4214) 2025-05-31 17:40:32 +08:00
worker.py fix: re-enable tp/pp for quickstart_advanced.py. (#4766) 2025-05-31 19:13:46 +08:00