TensorRT-LLMs/docs/source/legacy/performance/performance-tuning-guide/index.rst
Guoming Zhang 085271eceb
[None][doc] Clean the doc folder and move the outdated docs into lega… (#7729)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
2025-09-16 11:43:19 +08:00

16 lines
329 B
ReStructuredText

Performance Tuning Guide
=======================
.. include:: introduction.md
:parser: myst_parser.sphinx_
.. toctree::
:maxdepth: 1
benchmarking-default-performance
useful-build-time-flags
tuning-max-batch-size-and-max-num-tokens
deciding-model-sharding-strategy
fp8-quantization
useful-runtime-flags