TensorRT-LLMs/tensorrt_llm/models
Sharan Chetlur 258c7540c0 open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725)
Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>

open source f8c0381a2bc50ee2739c3d8c2be481b31e5f00bd (#2736)

Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>

Add note for blackwell (#2742)

Update the docs to workaround the extra-index-url issue (#2744)

update README.md (#2751)

Fix github io pages (#2761)

Update
2025-02-11 02:21:51 +00:00
..
baichuan TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
bert TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
bloom TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
chatglm TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
cogvlm TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
commandr TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
dbrx TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
deepseek_v1 open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
deepseek_v2 open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
dit TensorRT-LLM v0.13 Update (#2269) 2024-09-30 16:20:23 +08:00
eagle open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
enc_dec TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
falcon TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
gemma open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
gpt TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
gptj TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
gptneox TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
grok TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
llama open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
mamba TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
medusa open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
mllama open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
mpt TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
nemotron_nas open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
opt TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
phi TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
phi3 open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
qwen open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
recurrentgemma TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
redrafter TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
unet TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
__init__.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
automodel.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
convert_utils.py TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
generation_mixin.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
model_weights_loader.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
modeling_utils.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00