TensorRT-LLMs/scripts
Ming Wei ed887940d4
infra: open source XQA kernels (#3762)
Replace libtensorrt_llm_nvrtc_wrapper.so with its source code, which
consists of two parts:

1. NVRTC glue code
2. XQA kernel code

During TensorRT-LLM build, XQA kernel code is embedded as C++ arries via
gen_cpp_header.py and passed to NVRTC for JIT compilation.

Signed-off-by: Ming Wei <2345434+ming-wei@users.noreply.github.com>
2025-04-30 18:05:15 +08:00
..
bandit.yaml Update TensorRT-LLM (#1055) 2024-02-06 18:38:07 +08:00
build_cpp_examples.py refactor: Clean up CMakeLists.txt (#3479) 2025-04-18 14:39:29 +08:00
build_wheel.py infra: open source XQA kernels (#3762) 2025-04-30 18:05:15 +08:00
check_test_list.py infra: Add test list name check (#3097) 2025-04-20 23:02:16 +08:00
collect_unittests.py Update (#2978) 2025-03-23 16:39:35 +08:00
dco_check.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
generate_duration.py infra: Add step to generate new duration file (#3298) 2025-04-18 12:56:31 +08:00
git_replace.py Update (#2978) 2025-03-23 16:39:35 +08:00
package_trt_llm.py test: reorganize tests folder hierarchy (#2996) 2025-03-27 12:07:53 +08:00
release_check.py add precommit check to github actions (#3129) 2025-04-11 06:40:53 +08:00
replace_version.sh Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00