mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-02-08 20:21:48 +08:00
9 lines
308 B
ReStructuredText
9 lines
308 B
ReStructuredText
Speculative Decoding
|
|
====================
|
|
Source https://github.com/NVIDIA/TensorRT-LLM/blob/e4c707845ff58fcc0b1d87afb4dd0e64885c780a/examples/llm-api/llm_speculative_decoding.py.
|
|
|
|
.. literalinclude:: ../../../examples/llm-api/llm_speculative_decoding.py
|
|
:lines: 4-95
|
|
:language: python
|
|
:linenos:
|