| .. |
|
batch_manager
|
[None][feat] Support NVFP4 KV Cache (#6244)
|
2025-09-01 09:24:52 +08:00 |
|
common
|
[None][feat] Support NVFP4 KV Cache (#6244)
|
2025-09-01 09:24:52 +08:00 |
|
cutlass_extensions/include/cutlass_extensions
|
[None][perf] Disable Swap AB when num tokens exceeds N dimension (#7104)
|
2025-08-28 21:29:55 -04:00 |
|
deep_ep
|
[None][feat] DeepEP LL combine FP4 (#6822)
|
2025-08-13 04:20:21 -04:00 |
|
deep_gemm
|
[https://nvbugs/5433581][fix] DeepGEMM installation on SBSA (#6588)
|
2025-08-06 16:44:21 +08:00 |
|
executor
|
[None][feat] Update TargetInfo to accommodate CP in disagg (#7224)
|
2025-08-29 15:56:20 -04:00 |
|
executor_worker
|
Update TensorRT-LLM (#2792)
|
2025-02-18 21:27:39 +08:00 |
|
kernels
|
[None][feat] Support NVFP4 KV Cache (#6244)
|
2025-09-01 09:24:52 +08:00 |
|
layers
|
refactor: Remove enforced sorted order of batch slots (#3502)
|
2025-07-14 17:23:02 +02:00 |
|
nanobind
|
[None][fix] Fix nanobind failure (#7425)
|
2025-09-01 17:26:40 -04:00 |
|
plugins
|
[TRTLLM-7319][perf] Fuse slicing into MoE. (#6728)
|
2025-08-25 16:52:30 -04:00 |
|
pybind
|
[None][feat] Support NVFP4 KV Cache (#6244)
|
2025-09-01 09:24:52 +08:00 |
|
runtime
|
[None][feat] KV Cache Connector API (#7228)
|
2025-08-28 23:09:27 -04:00 |
|
testing
|
fix: Improve chunking test and skip empty kernel calls (#5710)
|
2025-07-04 09:08:15 +02:00 |
|
thop
|
[https://nvbugs/5412562][feat] Allocate MoE workspace only when necessary (release/1.0 retargeted) (#6955)
|
2025-09-01 11:02:31 +08:00 |
|
CMakeLists.txt
|
[https://nvbugs/5453827][fix] Fix RPATH of th_common shared library to find pip-installed NCCL (#6984)
|
2025-08-21 17:58:30 +08:00 |