TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-25 05:02:59 +08:00

History

rakib-hasan ff3b741045 feat: adding multimodal (only image for now) support in trtllm-bench (#3490 ) * feat: adding multimodal (only image for now) support in trtllm-bench Signed-off-by: Rakib Hasan <rhasan@nvidia.com> * fix: add in load_dataset() calls to maintain the v2.19.2 behavior Signed-off-by: Rakib Hasan <rhasan@nvidia.com> * re-adding prompt_token_ids and using that for prompt_len Signed-off-by: Rakib Hasan <rhasan@nvidia.com> * updating the datasets version in examples as well Signed-off-by: Rakib Hasan <rhasan@nvidia.com> * api changes are not needed Signed-off-by: Rakib Hasan <rhasan@nvidia.com> * moving datasets requirement and removing a missed api change Signed-off-by: Rakib Hasan <rhasan@nvidia.com> * addressing review comments Signed-off-by: Rakib Hasan <rhasan@nvidia.com> * refactoring the quickstart example Signed-off-by: Rakib Hasan <rhasan@nvidia.com> --------- Signed-off-by: Rakib Hasan <rhasan@nvidia.com>		2025-04-18 07:06:16 +08:00
..
apps	doc: add genai-perf benchmark & slurm multi-node for trtllm-serve doc (#3407 )	2025-04-16 00:11:58 +08:00
__init__.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
_run_mpi_comm_task.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
fake.sh	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
grid_searcher.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
run_llm_exit.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
run_llm_with_postproc.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
run_llm.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_build_cache.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_executor.py	Add thread leak check and fix thread/memory leak issues. (#3270 )	2025-04-08 19:03:18 +08:00
test_llm_args.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
test_llm_download.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
test_llm_kv_cache_events.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
test_llm_models.py	chore: clean some ci of qa test (#3083 )	2025-03-31 14:30:41 +08:00
test_llm_multi_gpu.py	tests: waive test_llm_multi_node (#3664 )	2025-04-18 01:59:16 +08:00
test_llm_perf_evaluator.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
test_llm_quant.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
test_llm_utils.py	chore: refactor the LlmArgs with Pydantic and migrate remaining pybinding configs to python (#3025 )	2025-04-05 13:31:48 +08:00
test_llm.py	feat: adding multimodal (only image for now) support in trtllm-bench (#3490 )	2025-04-18 07:06:16 +08:00
test_mpi_session.py	Update (#2978 )	2025-03-23 16:39:35 +08:00