| .. |
|
attention
|
[TRTLLM-9416][feat] Skip DS-v3.2 indexer MQA and Top-K for short sequences. (#9524)
|
2025-12-15 12:42:25 +08:00 |
|
auto_deploy
|
[#7532][feat] AutoDeploy: gather logits before lm head (#9962)
|
2025-12-17 19:50:13 -08:00 |
|
compilation
|
[TRTLLM-3105][feat] Add Piecewise CUDA Graph Support (#3804)
|
2025-05-09 11:04:01 +08:00 |
|
debugger
|
Fix: fix nvbug 5356427 (#5464)
|
2025-06-25 22:24:26 +08:00 |
|
executor
|
[TRTLLM-5972][chore] Load balance decode token KV cache with helix parallelism (#9757)
|
2025-12-12 22:29:05 +08:00 |
|
misc
|
[TRTLLM-9615][feat] Implement a distributed tuning system (#9621)
|
2025-12-15 21:08:53 +08:00 |
|
modeling
|
[https://nvbugs/5721644][fix] Update tests for nemotron_h (#9993)
|
2025-12-18 12:38:02 +08:00 |
|
models/checkpoints/hf
|
[TRTLLM-7136][feat] Update load_weights method to include mapping parameter in checkpoint loaders (#9583)
|
2025-12-05 16:07:20 +01:00 |
|
modules
|
[FMDL-1222][feat] Support weight and weight_scale padding for NVFP4 MoE cutlass (#9358)
|
2025-12-16 12:41:17 +08:00 |
|
multi_gpu
|
[https://nvbugs/5597647][ci] Unwaive fixed tests. (#9812)
|
2025-12-12 02:29:30 +08:00 |
|
multi_gpu_modeling
|
[https://nvbugs/5515753][ci] Add NCCL_DEBUG=INFO flag to collect more info with CI failure. (#8440)
|
2025-11-20 12:43:13 -05:00 |
|
multimodal
|
[TRTLLM-9601][feat] Expose mmKeys for multimodal to integrate with dynamo. (#9604)
|
2025-12-15 08:42:30 +08:00 |
|
ray_orchestrator
|
[https://nvbugs/5741060][fix] Fix pg op test (#9989)
|
2025-12-17 09:44:25 +08:00 |
|
sampler
|
[https://nvbugs/5708810][fix] Fix TRTLLMSampler (#9710)
|
2025-12-15 23:26:52 +01:00 |
|
speculative
|
[None][fix] Fix iteration stats for spec-dec (#9855)
|
2025-12-16 14:11:38 -08:00 |
|
thop
|
[None][feat] Add routing support for the new model for both cutlass and trtllm moe backend (#9792)
|
2025-12-15 19:59:08 -08:00 |
|
helpers.py
|
[#8733][feat] Add Llama4 MoE handling to AutoDeploy (#9556)
|
2025-12-04 08:03:33 +02:00 |
|
pattern_watcher.py
|
[TRTLLM-3105][feat] Add Piecewise CUDA Graph Support (#3804)
|
2025-05-09 11:04:01 +08:00 |
|
test_connector.py
|
[None][feat] KV Cache Connector API (#7228)
|
2025-08-28 23:09:27 -04:00 |