TensorRT-LLMs/tensorrt_llm/runtime
2025-04-02 17:01:16 +08:00
..
memory_pools TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
processor_wrapper open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
__init__.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
enc_dec_model_runner.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
generation.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
kv_cache_manager.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
medusa_utils.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
model_runner_cpp.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
model_runner.py TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
multimodal_model_runner.py TensorRT-LLM v0.18 release (#3231) 2025-04-02 17:01:16 +08:00
redrafter_utils.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
session.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00