TensorRT-LLMs/examples
dongxuy04 5f26e44ead
open source 3706e7395b9b58994412617992727c8ff2d14c9f (#2010)
Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
2024-07-24 05:48:06 +08:00
..
apps Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
arctic Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
baichuan Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
bert Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
bindings/executor Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
blip2 Update TensorRT-LLM (#1168) 2024-02-27 17:37:34 +08:00
bloom Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
chatglm Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
cogvlm Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
cpp/executor Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
cpp_library Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
dbrx Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
dit Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
enc_dec Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
falcon Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
gemma Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
gpt Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
gptj Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
gptneox Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
grok Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
high-level-api Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
infinitebench Update TensorRT-LLM (#1725) 2024-06-04 20:26:32 +08:00
internlm Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
internlm2 Update TensorRT-LLM (#1725) 2024-06-04 20:26:32 +08:00
jais Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
llama open source 3706e7395b9b58994412617992727c8ff2d14c9f (#2010) 2024-07-24 05:48:06 +08:00
mamba Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
medusa Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
mixtral Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
model_api Update TensorRT-LLM (#1891) 2024-07-04 14:37:19 +08:00
mpt Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
multimodal Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
nemotron Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
openai_triton Update TensorRT-LLM (#1891) 2024-07-04 14:37:19 +08:00
opt Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
phi Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
quantization Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
qwen Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
qwenvl Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
recurrentgemma Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
redrafter Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
sample_weight_stripping Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
skywork Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
smaug Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
whisper Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
eval_long_context.py Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
generate_checkpoint_config.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
hf_lora_convert.py Update TensorRT-LLM (#1891) 2024-07-04 14:37:19 +08:00
mmlu.py Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
run.py Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
summarize.py Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
utils.py Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00