TensorRT-LLMs/tensorrt_llm/runtime
2024-11-05 10:17:16 +00:00
..
__init__.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
enc_dec_model_runner.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
generation.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
kv_cache_manager.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
medusa_utils.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
model_runner_cpp.py open source v0.12-jetson 2024-11-05 10:17:16 +00:00
model_runner.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
multimodal_model_runner.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
redrafter_utils.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
session.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00