mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Kaiyu Xie e06f537e08 Update TensorRT-LLM (#1019 ) * Update TensorRT-LLM --------- Co-authored-by: erenup <ping.nie@pku.edu.cn> Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>		2024-01-31 21:55:32 +08:00
..
manual_plugin	Update TensorRT-LLM (#1019 )	2024-01-31 21:55:32 +08:00
plugin_autogen	Update TensorRT-LLM (#1019 )	2024-01-31 21:55:32 +08:00
README.md	Update	2023-10-10 23:22:17 -07:00

README.md

Integration for OpenAI Triton

The typical approach to integrate a kernel into TensorRT-LLM is to create TensorRT plugins. Specially for integrating OpenAI Triton kernels, there are two methods:

Creating TensorRT plugin manually, you can refer to manual plugin example for details,
Generate the TensorRT plugins automatically, please refer to automatic plugin example for details.