TensorRT-LLMs/tensorrt_llm/_torch/speculative
Fanrong Li 6cbc9a5297
[nvbug/5354946][fix] Fix mtp vanilla draft inputs (#5568)
Signed-off-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
2025-06-30 15:59:12 +08:00
..
__init__.py [TRTLLM-5000][feat] NGrams V2 (#4569) 2025-06-27 23:00:17 +08:00
draft_target.py Speculation: Draft Target in new FW (#4558) 2025-06-17 02:26:08 +08:00
drafter.py [TRTLLM-5000][feat] NGrams V2 (#4569) 2025-06-27 23:00:17 +08:00
eagle3.py Revert "feature: unify new_tokens format sample state to trtllm samper new_tokens format (#4401)" (#5474) 2025-06-25 20:56:04 -07:00
interface.py [TRTLLM-5000][feat] NGrams V2 (#4569) 2025-06-27 23:00:17 +08:00
mtp.py [nvbug/5354946][fix] Fix mtp vanilla draft inputs (#5568) 2025-06-30 15:59:12 +08:00
ngram.py [TRTLLM-5000][feat] NGrams V2 (#4569) 2025-06-27 23:00:17 +08:00
utils.py [TRTLLM-5000][feat] NGrams V2 (#4569) 2025-06-27 23:00:17 +08:00