TensorRT-LLMs/Makefile at 89336fbf07422b034af144b86ecfb415aaa64372 - TensorRT-LLMs - Gitea: Git with a cup of tea

kanshan/TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

Ming Wei ed887940d4

infra: open source XQA kernels (#3762 )

Replace libtensorrt_llm_nvrtc_wrapper.so with its source code, which
consists of two parts:

1. NVRTC glue code
2. XQA kernel code

During TensorRT-LLM build, XQA kernel code is embedded as C++ arries via
gen_cpp_header.py and passed to NVRTC for JIT compilation.

Signed-off-by: Ming Wei <2345434+ming-wei@users.noreply.github.com>

2025-04-30 18:05:15 +08:00

8 lines

232 B

Makefile

Raw Blame History

 test_nvrtc:
 	(cd .. && python3 gen_cpp_header.py) && g++ test_nvrtc.cpp -I/usr/local/cuda/include -I../generated -L/usr/local/cuda/lib64 -o test_nvrtc -lnvrtc -lcuda -lcudart
 test: test_nvrtc
 	./test_nvrtc
 .PHONY: test_nvrtc test