TensorRT-LLMs/examples
Kaiyu Xie 9bd15f1937
TensorRT-LLM v0.10 update
* TensorRT-LLM Release 0.10.0

---------

Co-authored-by: Loki <lokravi@amazon.com>
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
2024-06-05 20:43:25 +08:00
..
apps Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
arctic TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
baichuan TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
bert Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
bindings/executor TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
blip2 Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
bloom TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
chatglm TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
cogvlm TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
cpp/executor TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
cpp_library Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
dbrx TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
enc_dec TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
falcon TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
gemma TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
gpt TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
gptj TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
gptneox TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
high-level-api TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
internlm TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
llama TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
mamba TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
medusa TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
mixtral TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
model_api Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
mpt TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
multimodal TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
nemotron TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
openai_triton Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
opt TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
phi TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
quantization TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
qwen TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
qwenvl TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
recurrentgemma TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
sample_weight_stripping TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
skywork TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
smaug TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
whisper TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
generate_checkpoint_config.py Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
hf_lora_convert.py TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
mmlu.py TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
run.py TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
summarize.py TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
utils.py TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00