| .. |
|
_torch
|
[TRTLLM-4406][feat] LLM sleep & wakeup Part 1: virtual device memory (#5034)
|
2025-08-04 13:51:01 +08:00 |
|
api_stability
|
[None][feat] Add support of scheduling attention dp request (#6246)
|
2025-08-01 20:38:01 -04:00 |
|
bindings
|
[nvbugs/5404000] fix: waive request_perf_metrics_draft test on pre-Hopper GPUs (#6339)
|
2025-07-28 12:36:44 -07:00 |
|
disaggregated
|
feat: Dynamically remove servers in PD (#5270)
|
2025-06-25 09:50:04 +08:00 |
|
llmapi
|
[None][fix] Revert commit 48ddc3d & add test for disagg server with different max_num_tokens (#6259)
|
2025-08-04 15:09:51 +08:00 |
|
others
|
test: reorganize tests folder hierarchy (#2996)
|
2025-03-27 12:07:53 +08:00 |
|
scaffolding
|
[https://nvbugs/5387375] fix(scaffolding): fix scaffolding aime test in test_e2e (#6140)
|
2025-07-18 10:34:37 +08:00 |
|
tools
|
enh: Add script to map tests <-> jenkins stages & vice-versa (#5177)
|
2025-07-19 00:50:40 +08:00 |
|
trt
|
Update transformers to 4.53.0 (#5747)
|
2025-07-09 09:32:24 -07:00 |
|
utils
|
[TRTLLM-5826][feat] Support pytorch LoRA adapter eviction (#5616)
|
2025-07-20 08:00:14 +03:00 |
|
conftest.py
|
[feat][test] reuse MPI pool executor across tests (#5566)
|
2025-06-29 17:23:12 +03:00 |
|
dump_checkpoint_stats.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
gc_utils.py
|
[nvbug 5273941] fix: broken cyclic reference detect (#5417)
|
2025-07-01 20:12:55 +08:00 |
|
profile_utils.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
pytest.ini
|
[ci] parallelize torch unittests (#5714)
|
2025-07-09 11:05:57 +03:00 |
|
test_model_runner_cpp.py
|
Update TensorRT-LLM (#2936)
|
2025-03-18 21:25:19 +08:00 |
|
test_pip_install.py
|
Deepseek R1 FP8 Support on Blackwell (#6486)
|
2025-08-01 10:26:28 +08:00 |