TensorRT-LLMs/tensorrt_llm/bench/dataclasses
Zongfei Jing 1e5af736ea
Add smart router for moe (#3641)
Signed-off-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com>
2025-04-23 12:21:59 +08:00
..
__init__.py Update TensorRT-LLM (#2502) 2024-11-26 16:51:34 +08:00
configuration.py Add smart router for moe (#3641) 2025-04-23 12:21:59 +08:00
engine.py Update TensorRT-LLM (#2502) 2024-11-26 16:51:34 +08:00
enums.py Update TensorRT-LLM (#2502) 2024-11-26 16:51:34 +08:00
general.py feat: adding multimodal (only image for now) support in trtllm-bench (#3490) 2025-04-18 07:06:16 +08:00
reporting.py fix: Correct reporting of text dtype for Llama 4 (#3494) 2025-04-18 00:07:49 +08:00
statistics.py perf: Add total token throughput metric. (#3212) 2025-04-05 13:17:59 +08:00