TensorRT-LLMs/_sources/examples/llm_kv_cache_offloading.rst.txt
2025-11-25 03:40:39 +00:00

9 lines
305 B
ReStructuredText

KV Cache Offloading
===================
Source https://github.com/NVIDIA/TensorRT-LLM/blob/a761585d9c15b4c1249aaf65a8f90764efa83a3c/examples/llm-api/llm_kv_cache_offloading.py.
.. literalinclude:: ../../../examples/llm-api/llm_kv_cache_offloading.py
:lines: 4-134
:language: python
:linenos: