TensorRT-LLMs/tests/unittest/_torch/speculative
YueWeng a4243f0da5
[TRTLLM-6393][feat] add static tree sampling and verification (#7161)
Signed-off-by: Yue Weng <25103990+yweng0828@users.noreply.github.com>
2025-09-26 13:16:16 -04:00
..
test_draft_target.py [TRTLLM-7457][ci] Update & cleanup unittest parallel config (#7254) 2025-08-27 00:45:58 -04:00
test_draft_token_tree_sampling.py [TRTLLM-6393][feat] add static tree sampling and verification (#7161) 2025-09-26 13:16:16 -04:00
test_draft_token_tree_verification.py [TRTLLM-6393][feat] add static tree sampling and verification (#7161) 2025-09-26 13:16:16 -04:00
test_dynamic_spec_decode.py [None][fix] Assign [] to req.py_draft_tokens instead of None when spec decode is off (#7511) 2025-09-23 06:54:18 -07:00
test_eagle3.py [TRTLLM-7330][feat] Eagle3 cuda graph support for the first draft model inference (#7363) 2025-09-26 11:28:05 +08:00
test_kv_cache_reuse.py [TRTLLM-7457][ci] Update & cleanup unittest parallel config (#7254) 2025-08-27 00:45:58 -04:00
test_mtp.py [refactor] Simplification of Speculative decoding configs (#5639) 2025-07-10 11:37:30 -04:00
test_ngram.py [TRTLLM-7457][ci] Update & cleanup unittest parallel config (#7254) 2025-08-27 00:45:58 -04:00
test_torch_rejection_sampling.py [None][feat] Add test for speculative rejection sampler (2-model) (#6542) 2025-08-13 22:09:35 -04:00
test_user_provided.py [TRTLLM-7457][ci] Update & cleanup unittest parallel config (#7254) 2025-08-27 00:45:58 -04:00