TensorRT-LLMs/examples/openai_triton
Dan Blanaru 48686bca3a
open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273)
* Update TensorRT-LLM

---------
Co-authored-by: Qingquan Song <ustcsqq@gmail.com>
2024-09-30 13:51:19 +02:00
..
manual_plugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
plugin_autogen Update TensorRT-LLM (#1725) 2024-06-04 20:26:32 +08:00
README.md Update 2023-10-10 23:22:17 -07:00

Integration for OpenAI Triton

The typical approach to integrate a kernel into TensorRT-LLM is to create TensorRT plugins. Specially for integrating OpenAI Triton kernels, there are two methods:

  1. Creating TensorRT plugin manually, you can refer to manual plugin example for details,
  2. Generate the TensorRT plugins automatically, please refer to automatic plugin example for details.