TensorRT-LLMs/examples/auto_deploy/model_registry/configs/gemma3_1b.yaml

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-07 03:31:58 +08:00
# Configuration for Gemma 3 1B model
# Specific sequence length requirement due to small attention window
max_seq_len: 511
	`# Configuration for Gemma 3 1B model`
	`# Specific sequence length requirement due to small attention window`
	`max_seq_len: 511`