TensorRT-LLMs/jenkins
Ming Wei ed887940d4
infra: open source XQA kernels (#3762)
Replace libtensorrt_llm_nvrtc_wrapper.so with its source code, which
consists of two parts:

1. NVRTC glue code
2. XQA kernel code

During TensorRT-LLM build, XQA kernel code is embedded as C++ arries via
gen_cpp_header.py and passed to NVRTC for JIT compilation.

Signed-off-by: Ming Wei <2345434+ming-wei@users.noreply.github.com>
2025-04-30 18:05:15 +08:00
..
Build.groovy infra: open source XQA kernels (#3762) 2025-04-30 18:05:15 +08:00
BuildDockerImage.groovy infra: install Triton in the base image (#3759) 2025-04-28 07:36:30 +08:00
controlCCache.groovy infra: install Triton in the base image (#3759) 2025-04-28 07:36:30 +08:00
GH200ImageBuilder.groovy infra: install Triton in the base image (#3759) 2025-04-28 07:36:30 +08:00
L0_MergeRequest.groovy chore: update multi-gpu trigger file list (#3971) 2025-04-30 09:15:26 +08:00
L0_Test.groovy infra: open source XQA kernels (#3762) 2025-04-30 18:05:15 +08:00
license_cpp.json feat: Add support for FP8 MLA on Hopper and Blackwell. (#3190) 2025-04-07 15:14:13 +08:00