TensorRT-LLMs/tests/unittest/_torch/executor
Balaram Buddharaju af315d8ef1
[TRTLLM-5972][chore] Load balance decode token KV cache with helix parallelism (#9757)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
2025-12-12 22:29:05 +08:00
..
test_chunked_logits.py [TRTLLM-8031][feat] Add chunked return_generation_logits logic (#7831) 2025-10-01 12:47:07 -04:00
test_executor_request_queue.py [TRTLLM-8650][fix] beam search request validation (#8433) (#9228) 2025-11-21 04:08:45 -08:00
test_overlap_scheduler_input.json [TRTLLM-9295][fix] unflake test_overlap_scheduler.py::test_overlap_scheduler_consis… (#9146) 2025-11-14 11:36:22 +01:00
test_overlap_scheduler.py [None][refactor] Improve request processing function in sampler (#9671) 2025-12-05 16:41:49 +01:00
test_pytorch_model_engine.py [TRTLLM-5972][chore] Load balance decode token KV cache with helix parallelism (#9757) 2025-12-12 22:29:05 +08:00
test_resource_manager.py [TRTLLM-8511][feat] Add update_weights and sleep_wakeup support for rl integration (#8302) 2025-11-04 10:19:24 -08:00
test_router_dealer_ipc.py [https://nvbugs/5503440][fix] Fix potential hang due to wrong type of ZMQ socket and protocol for worker_init_status_queue (#7646) 2025-09-19 18:13:33 +08:00
test_scheduler_serializable_output.py [https://nvbugs/5677746][fix] Use first PP rank's schedule result in other PP ranks to fix PP hang (#9659) 2025-12-08 18:43:52 -08:00