TensorRT-LLMs/tensorrt_llm/_torch/speculative/drafter.py
Mike Iovine fa34cb7234
[refactor] Clean up drafter/resource manager creation logic (#5805)
Signed-off-by: Mike Iovine <6158008+mikeiovine@users.noreply.github.com>
2025-07-16 12:45:46 -07:00

17 lines
371 B
Python

from abc import ABC, abstractmethod
from ..pyexecutor.scheduler import ScheduledRequests
class Drafter(ABC):
@abstractmethod
def prepare_draft_tokens(
self,
scheduled_requests: ScheduledRequests,
) -> None:
"""
Prepare the drafter tokens for the forward computation this step.
"""
raise NotImplementedError