TensorRT-LLMs/tensorrt_llm/bench
rakib-hasan d0eb47d33a
[TRTLLM-5053] Refactoring and Unifying the Multimodal input preparation (#4506)
* refactoring the multimodal input prep

Signed-off-by: Rakib Hasan <rhasan@nvidia.com>

* adding out-of-tree override option

Signed-off-by: Rakib Hasan <rhasan@nvidia.com>

* adding exceptional case for llava-next

Signed-off-by: Rakib Hasan <rhasan@nvidia.com>

* fixing typo

Signed-off-by: Rakib Hasan <rhasan@nvidia.com>

* addressing review comments, adding placement option, handling tokenizer variations

Signed-off-by: Rakib Hasan <rhasan@nvidia.com>

* addressing pytest-asyncio behavior change

Signed-off-by: Rakib Hasan <rhasan@nvidia.com>

---------

Signed-off-by: Rakib Hasan <rhasan@nvidia.com>
2025-06-03 12:02:07 -07:00
..
benchmark [TRTLLM-5053] Refactoring and Unifying the Multimodal input preparation (#4506) 2025-06-03 12:02:07 -07:00
build test(perf): Add some Llama-3_3-Nemotron-Super-49B-v1 integration-perf-tests (TRT flow, trtllm-bench) (#4128) 2025-05-19 12:00:48 -07:00
dataclasses Cherry pick feat/llama4 to main (#4739) 2025-05-30 05:28:40 +08:00
utils [TRTLLM-5053] Refactoring and Unifying the Multimodal input preparation (#4506) 2025-06-03 12:02:07 -07:00
__init__.py Update TensorRT-LLM 2024-08-20 18:55:15 +08:00