TensorRT-LLMs/tensorrt_llm
2024-05-07 23:34:28 +08:00
..
auto_parallel Update TensorRT-LLM (#1554) 2024-05-07 23:34:28 +08:00
commands Update TensorRT-LLM (#1554) 2024-05-07 23:34:28 +08:00
hlapi Update TensorRT-LLM (#1554) 2024-05-07 23:34:28 +08:00
layers Update TensorRT-LLM (#1554) 2024-05-07 23:34:28 +08:00
models Update TensorRT-LLM (#1554) 2024-05-07 23:34:28 +08:00
plugin Update TensorRT-LLM (#1554) 2024-05-07 23:34:28 +08:00
quantization Update TensorRT-LLM (#1554) 2024-05-07 23:34:28 +08:00
runtime Update TensorRT-LLM (#1554) 2024-05-07 23:34:28 +08:00
tools Update TensorRT-LLM (#1492) 2024-04-24 14:44:22 +08:00
__init__.py Update TensorRT-LLM (#1492) 2024-04-24 14:44:22 +08:00
_common.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
_ipc_utils.py Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
_utils.py Update TensorRT-LLM (#1530) 2024-04-30 17:19:10 +08:00
builder.py Update TensorRT-LLM (#1554) 2024-05-07 23:34:28 +08:00
executor.py Update TensorRT-LLM (#1530) 2024-04-30 17:19:10 +08:00
functional.py Update TensorRT-LLM (#1554) 2024-05-07 23:34:28 +08:00
graph_rewriting.py Update TensorRT-LLM (#1233) 2024-03-05 18:32:53 +08:00
logger.py Update TensorRT-LLM (#1019) 2024-01-31 21:55:32 +08:00
lora_manager.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
mapping.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
module.py Update TensorRT-LLM (#1168) 2024-02-27 17:37:34 +08:00
network.py Update TensorRT-LLM (#1492) 2024-04-24 14:44:22 +08:00
parameter.py Update TensorRT-LLM (#1168) 2024-02-27 17:37:34 +08:00
profiler.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
top_model_mixin.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
version.py Update TensorRT-LLM (#1554) 2024-05-07 23:34:28 +08:00