TensorRT-LLMs/tensorrt_llm
2024-07-26 16:19:24 +08:00
..
auto_parallel Update TensorRT-LLM (#2016) 2024-07-24 19:50:28 +08:00
commands open source 315e9f5ccd286e906d4c0d402fefbf2f69a1febe (#2033) 2024-07-26 16:19:24 +08:00
hlapi open source 315e9f5ccd286e906d4c0d402fefbf2f69a1febe (#2033) 2024-07-26 16:19:24 +08:00
layers open source 315e9f5ccd286e906d4c0d402fefbf2f69a1febe (#2033) 2024-07-26 16:19:24 +08:00
models open source 315e9f5ccd286e906d4c0d402fefbf2f69a1febe (#2033) 2024-07-26 16:19:24 +08:00
plugin Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
quantization Update TensorRT-LLM (#2016) 2024-07-24 19:50:28 +08:00
runtime Update TensorRT-LLM (#2016) 2024-07-24 19:50:28 +08:00
tools Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
__init__.py Update TensorRT-LLM (#1891) 2024-07-04 14:37:19 +08:00
_common.py Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
_ipc_utils.py Update TensorRT-LLM (#1918) 2024-07-09 14:42:22 +08:00
_utils.py Update TensorRT-LLM (#2016) 2024-07-24 19:50:28 +08:00
builder.py open source 315e9f5ccd286e906d4c0d402fefbf2f69a1febe (#2033) 2024-07-26 16:19:24 +08:00
executor.py Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
functional.py open source 315e9f5ccd286e906d4c0d402fefbf2f69a1febe (#2033) 2024-07-26 16:19:24 +08:00
graph_rewriting.py Update TensorRT-LLM (#1233) 2024-03-05 18:32:53 +08:00
logger.py Update TensorRT-LLM (#1891) 2024-07-04 14:37:19 +08:00
lora_manager.py Update TensorRT-LLM (#2016) 2024-07-24 19:50:28 +08:00
mapping.py Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
module.py Update TensorRT-LLM (#1725) 2024-06-04 20:26:32 +08:00
network.py Update TensorRT-LLM (#2016) 2024-07-24 19:50:28 +08:00
parameter.py Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
profiler.py Update TensorRT-LLM (#1688) 2024-05-28 20:07:49 +08:00
top_model_mixin.py Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
version.py open source 315e9f5ccd286e906d4c0d402fefbf2f69a1febe (#2033) 2024-07-26 16:19:24 +08:00