kanshan/TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-13 22:18:36 +08:00

Venky dfa11d810e

[TRTC-102][docs] --extra_llm_api_options->--config in docs/examples/tests (#10005 )

2025-12-19 13:48:43 -05:00

464 B

Raw Blame History

Recommended LLM API Configuration Settings

This directory contains recommended LLM API performance settings for popular models. They can be used out-of-the-box with trtllm-serve via the --config CLI flag, or you can adjust them to your specific use case.

For model-specific deployment guides, please refer to the official documentation.