TensorRT-LLMs/tensorrt_llm
石晓伟 8f91cff22e
TensorRT-LLM Release 0.15.0 (#2529)
Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
2024-12-04 13:44:56 +08:00
..
auto_parallel TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
bench TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
commands TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
layers TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
llmapi TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
models TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
plugin TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
quantization TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
runtime TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
serve TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
tools TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
__init__.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
_common.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
_ipc_utils.py TensorRT-LLM v0.13 Update (#2269) 2024-09-30 16:20:23 +08:00
_utils.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
builder.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
executor.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
functional.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
graph_rewriting.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
logger.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
lora_manager.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
mapping.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
module.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
network.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
parameter.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
profiler.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
prompt_adapter_manager.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
python_plugin.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
top_model_mixin.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
version.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00