This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-08 12:12:33 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
9bfb6179ec
TensorRT-LLMs
/
cpp
/
tensorrt_llm
/
common
History
Ludwig Schneider
41ce14ab04
[None][feat] Enable NCCL_SYMMETRIC as default fallback for AllReduce (
#9314
)
...
Signed-off-by: Ludwig Schneider <lschneider@nvidia.com>
2025-12-07 09:43:26 -08:00
..
assert.cpp
attentionOp.cpp
[None][fix] Remove FP8 K/V buffer from TRTLLM sparse MLA attention kernel (
#9529
)
2025-11-28 15:26:52 +08:00
attentionOp.h
[TRTLLM-8778][feat] Add tree attention support for blackwell arch (
#8975
)
2025-11-17 09:01:53 +08:00
CMakeLists.txt
[
https://nvbugs/5451205
][feat] Add cuBLASLt NVFP4 GEMM backend support (
#7943
)
2025-10-23 15:55:10 +08:00
cublasMMWrapper.cpp
[
https://nvbugs/5451205
][feat] Add cuBLASLt NVFP4 GEMM backend support (
#7943
)
2025-10-23 15:55:10 +08:00
cublasMMWrapper.h
[
https://nvbugs/5451205
][feat] Add cuBLASLt NVFP4 GEMM backend support (
#7943
)
2025-10-23 15:55:10 +08:00
cublasVersionCheck.h
cudaBf16Fallbacks.cuh
cudaBufferUtils.cuh
cudaDriverWrapper.cpp
cudaDriverWrapper.h
cudaFp8Utils.cu
cudaProfilerUtils.cpp
cudaTypeUtils.cuh
customAllReduceUtils.h
[None][feat] Enable NCCL_SYMMETRIC as default fallback for AllReduce (
#9314
)
2025-12-07 09:43:26 -08:00
envUtils.cpp
[None][feat] Add TRTLLM_NIXL_KVCACHE_BACKEND environment variable for NIXL backend selection (
#9075
)
2025-11-17 15:39:55 -08:00
envUtils.h
[None][feat] Add TRTLLM_NIXL_KVCACHE_BACKEND environment variable for NIXL backend selection (
#9075
)
2025-11-17 15:39:55 -08:00
jsonSerializeOptional.h
lamportUtils.cuh
[None][feat] MNNVLAllreduce Kernel Refactor (
#8018
)
2025-11-05 08:49:47 +08:00
logger.cpp
mathUtils.h
mcastDevMemUtils.cpp
mcastDevMemUtils.h
memoryUtils.cu
memoryUtils.h
ncclUtils.cpp
[None][feat] Enable NCCL_SYMMETRIC as default fallback for AllReduce (
#9314
)
2025-12-07 09:43:26 -08:00
ncclUtils.h
[None][feat] Enable NCCL_SYMMETRIC as default fallback for AllReduce (
#9314
)
2025-12-07 09:43:26 -08:00
nvtxUtils.h
opUtils.cpp
[None][feat] Enable NCCL_SYMMETRIC as default fallback for AllReduce (
#9314
)
2025-12-07 09:43:26 -08:00
opUtils.h
quantTypeUtils.cuh
reduceKernelUtils.cuh
safetensors.cpp
safetensors.h
stlUtils.h
stringUtils.cpp
timestampUtils.cpp
timestampUtils.h
tllmException.cpp
[None][feat] Add Request specific exception (
#6931
)
2025-09-04 18:43:42 -04:00
vec_dtypes.cuh
[TRTLLM-7318][feat] MnnvlThroughput AlltoAll implementation. (
#7499
)
2025-10-27 13:23:06 -04:00
workspace.h