TensorRT-LLMs/tensorrt_llm/models/medusa
wili 2e3cf42e03
[refactor] Simplification of Speculative decoding configs (#5639)
Signed-off-by: wili-65535 <wili-65535@users.noreply.github.com>
Co-authored-by: wili-65535 <wili-65535@users.noreply.github.com>
2025-07-10 11:37:30 -04:00
..
__init__.py Update TensorRT-LLM (20240116) (#891) 2024-01-16 20:03:11 +08:00
config.py [refactor] Simplification of Speculative decoding configs (#5639) 2025-07-10 11:37:30 -04:00
model.py [refactor] Simplification of Speculative decoding configs (#5639) 2025-07-10 11:37:30 -04:00
weight.py Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00