TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-05 02:31:33 +08:00

History

Liao Lanyu fef0e4b17d [TRTLLM-10666][chore] Refactor request fetching logic for better separation of concerns (#10988 ) Signed-off-by: Lanyu Liao <lancelly@users.noreply.github.com> Signed-off-by: Lance Liao <108499334+lancelly@users.noreply.github.com> Signed-off-by: Liao Lanyu <108499334+lancelly@users.noreply.github.com> Co-authored-by: Lanyu Liao <lancelly@users.noreply.github.com>		2026-02-02 10:36:08 +08:00
..
test_async_transfer_manager.py	[None][chore] Async Transfer Manager (#9891 )	2026-01-20 12:12:47 -05:00
test_chunked_logits.py	[TRTLLM-8031][feat] Add chunked return_generation_logits logic (#7831 )	2025-10-01 12:47:07 -04:00
test_executor_request_queue.py	[TRTLLM-10666][chore] Refactor request fetching logic for better separation of concerns (#10988 )	2026-02-02 10:36:08 +08:00
test_overlap_scheduler_input.json	[TRTLLM-9295][fix] unflake test_overlap_scheduler.py::test_overlap_scheduler_consis… (#9146 )	2025-11-14 11:36:22 +01:00
test_overlap_scheduler.py	[TRTLLM-10029][scheduler] Re-implement MicroBatchScheduler and CapacityScheduler in Python (#10273 )	2026-01-20 10:31:13 +08:00
test_py_executor.py	[TRTLLM-10666][chore] Refactor request fetching logic for better separation of concerns (#10988 )	2026-02-02 10:36:08 +08:00
test_pytorch_model_engine.py	[https://nvbugs/5717993 ][fix] Add execution_stream across PyExecutor, KVCacheManager, PeftCacheManager to ensure proper CUDA stream synchronization between KV cache transfer operations and model forward kernels. (#10060 )	2025-12-31 09:22:54 -08:00
test_request_utils.py	[TRTLLM-10666][chore] Refactor request fetching logic for better separation of concerns (#10988 )	2026-02-02 10:36:08 +08:00
test_resource_manager.py	[https://nvbugs/5717993 ][fix] Add execution_stream across PyExecutor, KVCacheManager, PeftCacheManager to ensure proper CUDA stream synchronization between KV cache transfer operations and model forward kernels. (#10060 )	2025-12-31 09:22:54 -08:00
test_router_dealer_ipc.py	[https://nvbugs/5503440 ][fix] Fix potential hang due to wrong type of ZMQ socket and protocol for worker_init_status_queue (#7646 )	2025-09-19 18:13:33 +08:00
test_scheduler_serializable_output.py	[https://nvbugs/5677746 ][fix] Use first PP rank's schedule result in other PP ranks to fix PP hang (#9659 )	2025-12-08 18:43:52 -08:00