TensorRT-LLMs/tensorrt_llm/bench/dataclasses
Frank cf15efa15e
[TRTLLM-4883][fix]: Update output speed calculation. (#3923)
* Update gen tps calculation.

Signed-off-by: Frank Di Natale <3429989+FrankD412@users.noreply.github.com>

* Add back output speed for comparison.

Signed-off-by: Frank Di Natale <3429989+FrankD412@users.noreply.github.com>

* Fix issue with f-string.

Signed-off-by: Frank Di Natale <3429989+FrankD412@users.noreply.github.com>

* Fix some spacing.

Signed-off-by: Frank Di Natale <3429989+FrankD412@users.noreply.github.com>

* Replace output speed with per-request genphase tput.

Signed-off-by: Frank Di Natale <3429989+FrankD412@users.noreply.github.com>

* Add gen TPS breakdown.

Signed-off-by: Frank Di Natale <3429989+FrankD412@users.noreply.github.com>

* Update some tagging.

Signed-off-by: Frank Di Natale <3429989+FrankD412@users.noreply.github.com>

---------

Signed-off-by: Frank Di Natale <3429989+FrankD412@users.noreply.github.com>
2025-04-29 11:04:12 +08:00
..
__init__.py Update TensorRT-LLM (#2502) 2024-11-26 16:51:34 +08:00
configuration.py Add smart router for moe (#3641) 2025-04-23 12:21:59 +08:00
engine.py Update TensorRT-LLM (#2502) 2024-11-26 16:51:34 +08:00
enums.py Update TensorRT-LLM (#2502) 2024-11-26 16:51:34 +08:00
general.py feat: adding multimodal (only image for now) support in trtllm-bench (#3490) 2025-04-18 07:06:16 +08:00
reporting.py [TRTLLM-4883][fix]: Update output speed calculation. (#3923) 2025-04-29 11:04:12 +08:00
statistics.py [TRTLLM-4883][fix]: Update output speed calculation. (#3923) 2025-04-29 11:04:12 +08:00