TensorRT-LLMs/cpp/tensorrt_llm/common
Wangjue Yao 9f283f330b
[None][feat] Support Mooncake transfer engine as a cache transceiver backend (#8309)
Signed-off-by: wjueyao <wyao123@terpmail.umd.edu>
Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
Co-authored-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
2025-12-19 10:09:51 +08:00
..
assert.cpp
attentionOp.cpp
attentionOp.h
CMakeLists.txt
cublasMMWrapper.cpp
cublasMMWrapper.h
cublasVersionCheck.h
cudaBf16Fallbacks.cuh
cudaBufferUtils.cuh
cudaDriverWrapper.cpp
cudaDriverWrapper.h
cudaFp8Utils.cu
cudaProfilerUtils.cpp
cudaTypeUtils.cuh
customAllReduceUtils.h
envUtils.cpp [None][feat] Support Mooncake transfer engine as a cache transceiver backend (#8309) 2025-12-19 10:09:51 +08:00
envUtils.h [None][feat] Support Mooncake transfer engine as a cache transceiver backend (#8309) 2025-12-19 10:09:51 +08:00
ipUtils.cpp [None][feat] Support Mooncake transfer engine as a cache transceiver backend (#8309) 2025-12-19 10:09:51 +08:00
ipUtils.h [None][feat] Support Mooncake transfer engine as a cache transceiver backend (#8309) 2025-12-19 10:09:51 +08:00
jsonSerializeOptional.h
lamportUtils.cuh
logger.cpp
mathUtils.h
mcastDevMemUtils.cpp
mcastDevMemUtils.h
memoryUtils.cu
memoryUtils.h
ncclUtils.cpp
ncclUtils.h
nvtxUtils.h
opUtils.cpp
opUtils.h
quantTypeUtils.cuh
reduceKernelUtils.cuh
safetensors.cpp
safetensors.h
stlUtils.h
stringUtils.cpp
timestampUtils.cpp
timestampUtils.h
tllmException.cpp
vec_dtypes.cuh
workspace.h