TensorRT-LLMs/cpp/tensorrt_llm/kernels/communicationKernels
Void 950cadf2bd
add support for smaller hidden_dim (#3609)
Signed-off-by: Yilin Zhang <18275976+yilin-void@users.noreply.github.com>
Co-authored-by: Yukun He <23156053+hyukn@users.noreply.github.com>
2025-04-17 12:00:32 +08:00
..
allReduceFusionKernels.cu add support for smaller hidden_dim (#3609) 2025-04-17 12:00:32 +08:00
allReduceFusionKernels.h feat: fix and improve allreduce and fusion kernels (#3064) 2025-04-08 19:33:52 +08:00
allReduceWorkspace.cu feat: fix and improve allreduce and fusion kernels (#3064) 2025-04-08 19:33:52 +08:00
allReduceWorkspace.h feat: fix and improve allreduce and fusion kernels (#3064) 2025-04-08 19:33:52 +08:00
moeAllReduceFusionKernels.cu feat: fix and improve allreduce and fusion kernels (#3064) 2025-04-08 19:33:52 +08:00
moeAllReduceFusionKernels.h feat: fix and improve allreduce and fusion kernels (#3064) 2025-04-08 19:33:52 +08:00