TensorRT-LLMs/tensorrt_llm
Kaiyu Xie 3aa6b11d13
Update TensorRT-LLM (#2936)
* Update TensorRT-LLM

---------

Co-authored-by: changcui <cuichang147@gmail.com>
2025-03-18 21:25:19 +08:00
..
_torch Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
auto_parallel Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
bench Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
commands Update TensorRT-LLM (#2849) 2025-03-04 18:44:00 +08:00
executor Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
inputs Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
layers Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
llmapi Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
models Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
plugin Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
quantization Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
runtime Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
scaffolding Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
serve Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
tools Update TensorRT-LLM (#2849) 2025-03-04 18:44:00 +08:00
__init__.py Update TensorRT-LLM (#2820) 2025-02-25 21:21:49 +08:00
_common.py Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
_ipc_utils.py Update TensorRT-LLM (#2783) 2025-02-13 18:40:22 +08:00
_utils.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
builder.py Update TensorRT-LLM (#2820) 2025-02-25 21:21:49 +08:00
disaggregated_params.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
functional.py Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
graph_rewriting.py Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
logger.py Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
lora_manager.py Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
mapping.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
module.py Update TensorRT-LLM (#2253) 2024-09-24 17:27:31 +02:00
network.py Update TensorRT-LLM (#2820) 2025-02-25 21:21:49 +08:00
parameter.py Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
profiler.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
prompt_adapter_manager.py Update TensorRT-LLM (#2333) 2024-10-15 15:28:40 +08:00
python_plugin.py Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
sampling_params.py Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
top_model_mixin.py Update TensorRT-LLM (#2053) 2024-07-30 21:25:01 +08:00
version.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00