TensorRT-LLMs/tests/unittest/_torch/executor
Mike Iovine b3c57a7042
[TRTLLM-7353][feat] Implement capturable drafting loops for speculation (#7100)
Signed-off-by: Mike Iovine <6158008+mikeiovine@users.noreply.github.com>
2025-09-01 14:37:44 -04:00
..
test_executor_request_queue.py [None][opt] Balance the request based on number of tokens in AttentionDP (#7183) 2025-08-27 11:16:12 +08:00
test_overlap_scheduler_input.json [None][ci] move unittests to sub-directories (#6635) 2025-08-20 05:42:22 -04:00
test_overlap_scheduler.py [None][ci] move unittests to sub-directories (#6635) 2025-08-20 05:42:22 -04:00
test_pytorch_model_engine.py [TRTLLM-7353][feat] Implement capturable drafting loops for speculation (#7100) 2025-09-01 14:37:44 -04:00
test_resource_manager.py fix/improve kvcache allocation in PyTorch runtime (#5933) 2025-08-26 12:40:22 +08:00