Robin Kobus
|
94dd456bd0
|
refactor: Remove speculative decoding parameters from stateful decoders (#3024)
Simplify StatefulGptDecoderBatched constructor:
- Remove speculative decoding mode parameter
- Initialize with default mode=None
- Update GptSession class accordingly
Simplify setup method signatures in StatefulGptDecoder and StatefulGptDecoderBatched:
- Remove maxTokensPerStep parameter
- Initialize decoders with default maxTokensPerStep=1
- Update GptSession class accordingly
Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>
|
2025-03-26 20:16:26 +08:00 |
|