mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-14 06:27:45 +08:00
Replace libtensorrt_llm_nvrtc_wrapper.so with its source code, which consists of two parts: 1. NVRTC glue code 2. XQA kernel code During TensorRT-LLM build, XQA kernel code is embedded as C++ arries via gen_cpp_header.py and passed to NVRTC for JIT compilation. Signed-off-by: Ming Wei <2345434+ming-wei@users.noreply.github.com>
8 lines
232 B
Makefile
8 lines
232 B
Makefile
test_nvrtc:
|
|
(cd .. && python3 gen_cpp_header.py) && g++ test_nvrtc.cpp -I/usr/local/cuda/include -I../generated -L/usr/local/cuda/lib64 -o test_nvrtc -lnvrtc -lcuda -lcudart
|
|
|
|
test: test_nvrtc
|
|
./test_nvrtc
|
|
|
|
.PHONY: test_nvrtc test
|