| .. |
|
attention
|
[TRTLLM-9416][feat] Skip DS-v3.2 indexer MQA and Top-K for short sequences. (#9524)
|
2025-12-15 12:42:25 +08:00 |
|
auto_deploy
|
[#7532][feat] AutoDeploy: gather logits before lm head (#9962)
|
2025-12-17 19:50:13 -08:00 |
|
compilation
|
[TRTLLM-3105][feat] Add Piecewise CUDA Graph Support (#3804)
|
2025-05-09 11:04:01 +08:00 |
|
debugger
|
Fix: fix nvbug 5356427 (#5464)
|
2025-06-25 22:24:26 +08:00 |
|
executor
|
[TRTLLM-5972][chore] Load balance decode token KV cache with helix parallelism (#9757)
|
2025-12-12 22:29:05 +08:00 |
|
misc
|
[TRTLLM-9615][feat] Implement a distributed tuning system (#9621)
|
2025-12-15 21:08:53 +08:00 |
|
modeling
|
[https://nvbugs/5721644][fix] Update tests for nemotron_h (#9993)
|
2025-12-18 12:38:02 +08:00 |
|
models/checkpoints/hf
|
[TRTLLM-7136][feat] Update load_weights method to include mapping parameter in checkpoint loaders (#9583)
|
2025-12-05 16:07:20 +01:00 |
|
modules
|
[None][fix] avoid ID conversion for non enable_configurable_moe cases. (#10003)
|
2025-12-18 13:29:52 +08:00 |
|
multi_gpu
|
[https://nvbugs/5597647][ci] Unwaive fixed tests. (#9812)
|
2025-12-12 02:29:30 +08:00 |
|
multi_gpu_modeling
|
[https://nvbugs/5515753][ci] Add NCCL_DEBUG=INFO flag to collect more info with CI failure. (#8440)
|
2025-11-20 12:43:13 -05:00 |
|
multimodal
|
[TRTLLM-9601][feat] Expose mmKeys for multimodal to integrate with dynamo. (#9604)
|
2025-12-15 08:42:30 +08:00 |
|
ray_orchestrator
|
[https://nvbugs/5741060][fix] Fix pg op test (#9989)
|
2025-12-17 09:44:25 +08:00 |
|
sampler
|
[https://nvbugs/5708810][fix] Fix TRTLLMSampler (#9710)
|
2025-12-15 23:26:52 +01:00 |
|
speculative
|
Fix thread leak for test_draft_len_schedule. Enhance stability for test_spec_gate.
|
2025-12-19 02:01:38 +00:00 |
|
thop
|
[https://nvbugs/5456493][feat] Add fp8 bmm on sm120 (#9687)
|
2025-12-18 22:57:20 +08:00 |
|
helpers.py
|
[#8733][feat] Add Llama4 MoE handling to AutoDeploy (#9556)
|
2025-12-04 08:03:33 +02:00 |
|
pattern_watcher.py
|
[TRTLLM-3105][feat] Add Piecewise CUDA Graph Support (#3804)
|
2025-05-09 11:04:01 +08:00 |
|
test_connector.py
|
[None][feat] KV Cache Connector API (#7228)
|
2025-08-28 23:09:27 -04:00 |