TensorRT-LLMs/tensorrt_llm
2024-11-05 10:17:16 +00:00
..
auto_parallel open source v0.12-jetson 2024-11-05 10:17:16 +00:00
bench TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
commands TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
hlapi TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
layers TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
models TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
plugin TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
quantization TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
runtime open source v0.12-jetson 2024-11-05 10:17:16 +00:00
tools TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
__init__.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
_common.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
_ipc_utils.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
_utils.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
builder.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
executor.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
functional.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
graph_rewriting.py Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
logger.py TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
lora_manager.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
mapping.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
module.py TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
network.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
parameter.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
profiler.py open source v0.12-jetson 2024-11-05 10:17:16 +00:00
top_model_mixin.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
version.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00