TensorRT-LLMs/examples/openai_triton
Kaiyu Xie 250d9c293d
Update TensorRT-LLM Release branch (#1445)
* Update TensorRT-LLM

---------

Co-authored-by: Bhuvanesh Sridharan <bhuvan.sridharan@gmail.com>
Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>
Co-authored-by: Eddie-Wang1120 <wangjinheng1120@163.com>
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
2024-04-12 17:59:19 +08:00
..
manual_plugin Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
plugin_autogen Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
README.md Update 2023-10-10 23:22:17 -07:00

Integration for OpenAI Triton

The typical approach to integrate a kernel into TensorRT-LLM is to create TensorRT plugins. Specially for integrating OpenAI Triton kernels, there are two methods:

  1. Creating TensorRT plugin manually, you can refer to manual plugin example for details,
  2. Generate the TensorRT plugins automatically, please refer to automatic plugin example for details.