TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-29 15:15:08 +08:00

History

Yechan Kim 5460d18b10 feat: trtllm-serve multimodal support (#3590 ) * feat: trtllm-serve multimodal support Signed-off-by: yechank <161688079+yechank-nvidia@users.noreply.github.com> * remove disable argument Signed-off-by: yechank <161688079+yechank-nvidia@users.noreply.github.com> * remove disable Signed-off-by: yechank <161688079+yechank-nvidia@users.noreply.github.com> * add and separate tests and move the doc Signed-off-by: yechank <161688079+yechank-nvidia@users.noreply.github.com> * remove block_resue arg from serve.py Signed-off-by: yechank <161688079+yechank-nvidia@users.noreply.github.com> --------- Signed-off-by: yechank <161688079+yechank-nvidia@users.noreply.github.com> Co-authored-by: Haohang Huang <31998628+symphonylyh@users.noreply.github.com>		2025-04-19 05:01:28 +08:00
..
apps	feat: trtllm-serve multimodal support (#3590 )	2025-04-19 05:01:28 +08:00
__init__.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
_run_mpi_comm_task.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
fake.sh	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
grid_searcher.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
run_llm_exit.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
run_llm_with_postproc.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
run_llm.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_build_cache.py	Update TensorRT-LLM (#2936 )	2025-03-18 21:25:19 +08:00
test_executor.py	Add thread leak check and fix thread/memory leak issues. (#3270 )	2025-04-08 19:03:18 +08:00
test_llm_args.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
test_llm_download.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
test_llm_kv_cache_events.py	test: add kv cache event tests for disagg workers (#3602 )	2025-04-18 18:30:19 +08:00
test_llm_models.py	chore: clean some ci of qa test (#3083 )	2025-03-31 14:30:41 +08:00
test_llm_multi_gpu.py	fix hmac in remote mpi session (#3649 )	2025-04-18 17:47:51 +08:00
test_llm_perf_evaluator.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
test_llm_quant.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
test_llm_utils.py	chore: refactor the LlmArgs with Pydantic and migrate remaining pybinding configs to python (#3025 )	2025-04-05 13:31:48 +08:00
test_llm.py	feat: adding multimodal (only image for now) support in trtllm-bench (#3490 )	2025-04-18 07:06:16 +08:00
test_mpi_session.py	Update (#2978 )	2025-03-23 16:39:35 +08:00