TensorRT-LLMs/latest/_sources/examples/llm_quantization.rst.txt
2025-05-11 22:57:04 -07:00

9 lines
256 B
ReStructuredText

Generation with Quantization
============================
Source https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples/llm-api/llm_quantization.py.
.. literalinclude:: ../../../examples/llm-api/llm_quantization.py
:language: python
:linenos: