TensorRT-LLMs/tensorrt_llm
石晓伟 b8fc6633ba
Update TensorRT-LLM (#2156)
Co-authored-by: Bruno Magalhaes <bruno.magalhaes@synthesia.io>
2024-08-27 18:20:59 +08:00
..
auto_parallel Update TensorRT-LLM (#2094) 2024-08-07 16:44:43 +08:00
bench Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00
commands Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00
hlapi Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00
layers Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00
models Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00
plugin Update TensorRT-LLM 2024-08-20 18:55:15 +08:00
quantization Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00
runtime Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00
tools Update TensorRT-LLM (#2110) 2024-08-13 22:34:33 +08:00
__init__.py Update TensorRT-LLM 2024-08-20 18:55:15 +08:00
_common.py Update TensorRT-LLM (#2110) 2024-08-13 22:34:33 +08:00
_ipc_utils.py Update TensorRT-LLM 2024-08-20 18:55:15 +08:00
_utils.py Update TensorRT-LLM (#2016) 2024-07-24 19:50:28 +08:00
builder.py Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00
executor.py Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00
functional.py Update TensorRT-LLM 2024-08-20 18:55:15 +08:00
graph_rewriting.py Update TensorRT-LLM (#1233) 2024-03-05 18:32:53 +08:00
logger.py Update TensorRT-LLM (#1891) 2024-07-04 14:37:19 +08:00
lora_manager.py Update TensorRT-LLM (#2016) 2024-07-24 19:50:28 +08:00
mapping.py Update TensorRT-LLM (#2053) 2024-07-30 21:25:01 +08:00
module.py Update TensorRT-LLM (#1725) 2024-06-04 20:26:32 +08:00
network.py Update TensorRT-LLM (#2110) 2024-08-13 22:34:33 +08:00
parameter.py Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00
profiler.py Update TensorRT-LLM (#2094) 2024-08-07 16:44:43 +08:00
top_model_mixin.py Update TensorRT-LLM (#2053) 2024-07-30 21:25:01 +08:00
version.py Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00