TensorRT-LLMs/tensorrt_llm/models
2024-11-01 19:48:44 +08:00
..
baichuan TensorRT-LLM v0.13 Update (#2269) 2024-09-30 16:20:23 +08:00
bert TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
bloom TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
chatglm Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
cogvlm TensorRT-LLM v0.13 Update (#2269) 2024-09-30 16:20:23 +08:00
dbrx TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
deepseek_v1 Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
dit TensorRT-LLM v0.13 Update (#2269) 2024-09-30 16:20:23 +08:00
enc_dec Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
falcon Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
gemma Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
gpt Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
gptj TensorRT-LLM v0.13 Update (#2269) 2024-09-30 16:20:23 +08:00
gptneox TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
grok Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
llama Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
mamba Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
medusa TensorRT-LLM v0.13 Update (#2269) 2024-09-30 16:20:23 +08:00
mpt TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
nemotron_nas Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
opt TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
phi TensorRT-LLM v0.13 Update (#2269) 2024-09-30 16:20:23 +08:00
phi3 Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
qwen Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
recurrentgemma Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
redrafter Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
unet Update TensorRT-LLM Release branch (#1192) 2024-02-29 17:20:55 +08:00
__init__.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
automodel.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
convert_utils.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
generation_mixin.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
model_weights_loader.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
modeling_utils.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00