TensorRT-LLMs/tensorrt_llm/models/redrafter
wili 2e3cf42e03
[refactor] Simplification of Speculative decoding configs (#5639)
Signed-off-by: wili-65535 <wili-65535@users.noreply.github.com>
Co-authored-by: wili-65535 <wili-65535@users.noreply.github.com>
2025-07-10 11:37:30 -04:00
..
__init__.py Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
drafter.py Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
model.py [refactor] Simplification of Speculative decoding configs (#5639) 2025-07-10 11:37:30 -04:00
redrafter_helper.py fix: redrafter sampling (#3278) 2025-04-08 07:49:32 +08:00