TensorRT-LLMs/tensorrt_llm
Kaiyu Xie 0f041b7b57
Update TensorRT-LLM (#1098)
* Update TensorRT-LLM

* update submodule

* Remove unused binaries
2024-02-18 15:48:08 +08:00
..
commands Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
hlapi Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
layers Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
models Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
plugin Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
quantization Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
runtime Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
tools Update TensorRT-LLM (#1055) 2024-02-06 18:38:07 +08:00
__init__.py Update TensorRT-LLM (#1055) 2024-02-06 18:38:07 +08:00
_common.py Update TensorRT-LLM (#1019) 2024-01-31 21:55:32 +08:00
_ipc_utils.py Update TensorRT-LLM (20240116) (#891) 2024-01-16 20:03:11 +08:00
_utils.py Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
builder.py Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
engine.py Update TensorRT-LLM (#941) 2024-01-23 23:22:35 +08:00
executor.py Update TensorRT-LLM (#1019) 2024-01-31 21:55:32 +08:00
functional.py Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
graph_rewriting.py Update TensorRT-LLM (#941) 2024-01-23 23:22:35 +08:00
logger.py Update TensorRT-LLM (#1019) 2024-01-31 21:55:32 +08:00
mapping.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
module.py Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
network.py Update TensorRT-LLM (#941) 2024-01-23 23:22:35 +08:00
parameter.py Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
profiler.py Update TensorRT-LLM (#1019) 2024-01-31 21:55:32 +08:00
top_model_mixin.py Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
version.py Update TensorRT-LLM (#1055) 2024-02-06 18:38:07 +08:00