TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-21 10:15:46 +08:00

History

Ziyi Xiong 420f0fbcf5 [https://nvbugs/5522851 ][fix] Correct the logic to update kv_lens_cuda (#7790 ) Signed-off-by: ziyixiong-nv <219238287+ziyixiong-nv@users.noreply.github.com>		2025-09-19 08:11:29 +08:00
..
test_draft_target.py	[TRTLLM-7457][ci] Update & cleanup unittest parallel config (#7254 )	2025-08-27 00:45:58 -04:00
test_dynamic_spec_decode.py	[TRTLLM-6668][feat] Enable overlap scheduler for two-model spec decoding (#7651 )	2025-09-16 07:33:44 +08:00
test_eagle3.py	[https://nvbugs/5522851 ][fix] Correct the logic to update kv_lens_cuda (#7790 )	2025-09-19 08:11:29 +08:00
test_kv_cache_reuse.py	[TRTLLM-7457][ci] Update & cleanup unittest parallel config (#7254 )	2025-08-27 00:45:58 -04:00
test_mtp.py	[refactor] Simplification of Speculative decoding configs (#5639 )	2025-07-10 11:37:30 -04:00
test_ngram.py	[TRTLLM-7457][ci] Update & cleanup unittest parallel config (#7254 )	2025-08-27 00:45:58 -04:00
test_torch_rejection_sampling.py	[None][feat] Add test for speculative rejection sampler (2-model) (#6542 )	2025-08-13 22:09:35 -04:00
test_user_provided.py	[TRTLLM-7457][ci] Update & cleanup unittest parallel config (#7254 )	2025-08-27 00:45:58 -04:00