TensorRT-LLMs/examples
2024-01-09 22:48:48 +08:00
..
baichuan Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
bert Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
blip2 Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
bloom Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
chatglm Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
cpp_library Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
enc_dec Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
falcon Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
gpt Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
gptj Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
gptneox Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
high-level-api init (#848) 2024-01-09 22:48:48 +08:00
internlm Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
llama Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
mixtral Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
module_api Update TensorRT-LLM main branch (#754) 2023-12-27 17:41:24 +08:00
mpt Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
multimodal Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
openai_triton Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
opt Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
phi Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
quantization Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
qwen Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
server Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
whisper Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
mmlu.py Update TensorRT-LLM main branch (#754) 2023-12-27 17:41:24 +08:00
run.py Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
summarize.py Update TensorRT-LLM (#846) 2024-01-09 21:03:35 +08:00
utils.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00