TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Frank 5a6cb2b985 fix: Correct reporting of text dtype for Llama 4 (#3494 ) Signed-off-by: Frank Di Natale <3429989+FrankD412@users.noreply.github.com>		2025-04-18 00:07:49 +08:00
..
__init__.py	Update TensorRT-LLM (#2502 )	2024-11-26 16:51:34 +08:00
configuration.py	chore: refactor the LlmArgs with Pydantic and migrate remaining pybinding configs to python (#3025 )	2025-04-05 13:31:48 +08:00
engine.py	Update TensorRT-LLM (#2502 )	2024-11-26 16:51:34 +08:00
enums.py	Update TensorRT-LLM (#2502 )	2024-11-26 16:51:34 +08:00
general.py	Update TensorRT-LLM (#2849 )	2025-03-04 18:44:00 +08:00
reporting.py	fix: Correct reporting of text dtype for Llama 4 (#3494 )	2025-04-18 00:07:49 +08:00
statistics.py	perf: Add total token throughput metric. (#3212 )	2025-04-05 13:17:59 +08:00