TensorRT-LLMs/tensorrt_llm
2024-11-01 19:48:44 +08:00
..
auto_parallel Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
bench Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
commands Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
hlapi Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
layers Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
models Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
plugin Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
quantization Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
runtime Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
tools Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
__init__.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
_common.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
_ipc_utils.py TensorRT-LLM v0.13 Update (#2269) 2024-09-30 16:20:23 +08:00
_utils.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
builder.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
executor.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
functional.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
graph_rewriting.py Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
logger.py TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
lora_manager.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
mapping.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
module.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
network.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
parameter.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
profiler.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
top_model_mixin.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
version.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00