mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-02-07 03:31:58 +08:00
4 lines
123 B
YAML
4 lines
123 B
YAML
# Configuration for Gemma 3 1B model
|
|
# Specific sequence length requirement due to small attention window
|
|
max_seq_len: 511
|