TensorRT-LLMs/tensorrt_llm/bench/benchmark
Zongfei Jing 1e5af736ea
Add smart router for moe (#3641)
Signed-off-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com>
2025-04-23 12:21:59 +08:00
..
utils Add smart router for moe (#3641) 2025-04-23 12:21:59 +08:00
__init__.py Update TensorRT-LLM (#2389) 2024-10-29 22:24:38 +08:00
low_latency.py chore: refactor the LlmArgs with Pydantic and migrate remaining pybinding configs to python (#3025) 2025-04-05 13:31:48 +08:00
throughput.py Add smart router for moe (#3641) 2025-04-23 12:21:59 +08:00