mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-13 22:18:36 +08:00
Signed-off-by: Josh Bialkowski <1309820+cheshirekow@users.noreply.github.com> Co-authored-by: Josh Bialkowski <1309820+cheshirekow@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| CMakeLists.txt | ||
| deep_ep_cpp_tllm.version | ||
| nvshmem_fast_build.patch | ||
| nvshmem_src_3.2.5-1.txz | ||
| README.md | ||
| strip_nvshmem_helper.py | ||
How to generate nvshmem_fast_build.patch?
- Build the project without applying the
nvshmem_fast_build.patch. - Link NVSHMEM to DeepEP with one NVSHMEM object file omitted.
- Repeat step 2 until no more object files can be omitted.
- Remove the unused files from NVSHMEM's
CMakelists.txt, and save the differences asnvshmem_fast_build.patch.
The script strip_nvshmem_helper.py automatically performs steps 2 and 3.