TensorRT-LLMs/examples/auto_deploy/model_registry/configs
2026-02-09 23:26:51 -05:00
..
attn_backend_triton.yaml [None][chore] Update AutoDeploy model list (#10505) 2026-01-10 08:47:37 +02:00
compile_backend_torch_cudagraph.yaml [None][chore] Update AD coverage to use torch-cudagraph (#10233) 2025-12-23 07:20:32 -05:00
dashboard_default.yaml [#10246][feature] Move AD dashboard to use cudagraph compile backend (#10267) 2025-12-24 11:09:59 +02:00
demollm_triton.yaml [#9640][feat] Migrate model registry to v2.0 format with composable configs (#9836) 2025-12-19 05:30:02 -08:00
gemma3_1b.yaml [#9640][feat] Migrate model registry to v2.0 format with composable configs (#9836) 2025-12-19 05:30:02 -08:00
glm-4.7-flash.yaml [#11032][feat] MLA revisited and GLM 4.7 Flash support (#11324) 2026-02-09 23:26:51 -05:00
llama3_3_70b.yaml [#10013][feat] AutoDeploy: native cache manager integration (#10635) 2026-01-27 11:23:22 -05:00
llama4_maverick_lite.yaml [#9640][feat] Migrate model registry to v2.0 format with composable configs (#9836) 2025-12-19 05:30:02 -08:00
llama4_scout.yaml [#10013][feat] AutoDeploy: native cache manager integration (#10635) 2026-01-27 11:23:22 -05:00
multimodal.yaml [#9640][feat] Migrate model registry to v2.0 format with composable configs (#9836) 2025-12-19 05:30:02 -08:00
num_hidden_layers_5.yaml [None][chore] update model list (#11364) 2026-02-09 21:27:39 +02:00
openelm.yaml [#9640][feat] Migrate model registry to v2.0 format with composable configs (#9836) 2025-12-19 05:30:02 -08:00
qwen3_vl.yaml [None][chore] update model list (#11364) 2026-02-09 21:27:39 +02:00
simple_shard_only.yaml [#9640][feat] Migrate model registry to v2.0 format with composable configs (#9836) 2025-12-19 05:30:02 -08:00
world_size_1.yaml [#9640][feat] Migrate model registry to v2.0 format with composable configs (#9836) 2025-12-19 05:30:02 -08:00
world_size_2.yaml [#9640][feat] Migrate model registry to v2.0 format with composable configs (#9836) 2025-12-19 05:30:02 -08:00
world_size_4.yaml [#9640][feat] Migrate model registry to v2.0 format with composable configs (#9836) 2025-12-19 05:30:02 -08:00
world_size_8.yaml [#9640][feat] Migrate model registry to v2.0 format with composable configs (#9836) 2025-12-19 05:30:02 -08:00