TensorRT-LLMs/examples/auto_deploy/model_registry/configs/gemma3_1b.yaml
tcherckez-nvidia 9f6abaf59f
[#9640][feat] Migrate model registry to v2.0 format with composable configs (#9836)
Signed-off-by: Tal Cherckez <127761168+tcherckez-nvidia@users.noreply.github.com>
2025-12-19 05:30:02 -08:00

4 lines
123 B
YAML

# Configuration for Gemma 3 1B model
# Specific sequence length requirement due to small attention window
max_seq_len: 511