mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-14 06:27:45 +08:00
Signed-off-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com> Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com> Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
23 lines
649 B
ReStructuredText
23 lines
649 B
ReStructuredText
=======================================================
|
|
LLM Examples Introduction
|
|
=======================================================
|
|
|
|
Here is a simple example to show how to use the LLM with TinyLlama.
|
|
|
|
.. literalinclude:: ../../../examples/llm-api/quickstart_example.py
|
|
:language: python
|
|
:linenos:
|
|
|
|
The LLM API can be used for both offline or online usage. See more examples of the LLM API here:
|
|
|
|
.. toctree::
|
|
:maxdepth: 1
|
|
:caption: LLM API Examples
|
|
|
|
%EXAMPLE_DOCS%
|
|
|
|
For more details on how to fully utilize this API, check out:
|
|
|
|
* `Common customizations <customization.html>`_
|
|
* `LLM API Reference <../llm-api/index.html>`_
|