TensorRT-LLMs/examples
2025-04-16 14:42:50 +08:00
..
apps open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
arctic TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
baichuan TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
bert open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
bindings/executor open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
blip2 Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
bloom TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
chatglm TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
cogvlm TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
commandr TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
cpp/executor open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
cpp_library Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
dbrx TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
deepseek_v1 TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
deepseek_v2 TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
dit open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
draft_target_model TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
eagle TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
enc_dec open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
exaone TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
falcon TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
gemma TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
gpt TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
gptj TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
gptneox TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
granite open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
grok TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
infinitebench TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
internlm TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
internlm2 TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
jais TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
llama TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
llm-api open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
lookahead TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
mamba TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
medusa TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
mixtral TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
mllama open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
model_api TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
mpt TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
multimodal open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
nemotron TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
nemotron_nas TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
openai_triton open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
opt TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
phi TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
prompt_lookup TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
python_plugin open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
pytorch open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
quantization TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
qwen TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
qwenvl TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
recurrentgemma TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
redrafter TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
sample_weight_stripping TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
sdxl open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
skywork TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
smaug TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
whisper TensorRT-LLM v0.18.2 release (#3611) 2025-04-16 14:42:50 +08:00
eval_long_context.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
generate_checkpoint_config.py TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
hf_lora_convert.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
mmlu_llmapi.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
mmlu.py TensorRT-LLM v0.16 Release 2024-12-24 15:58:43 +08:00
run.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
summarize.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00
utils.py open source 09df54c0cc99354a60bbc0303e3e8ea33a96bef0 (#2725) 2025-02-11 02:21:51 +00:00