TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Go to file
Kaiyu Xie ace2b6917b
Add multi-version documents (#3861)
* Add 0.18.2

Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>

* Remove doctree

Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>

* Add 0.19.0rc0

Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>

* Add 0.20.0rc0

Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>

* Add latest

Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>

---------

Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
2025-04-25 10:35:43 -07:00
_cpp_gen Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
_downloads Update docs (#2732) 2025-02-11 02:56:32 +00:00
_images Update docs (#2732) 2025-02-11 02:56:32 +00:00
_modules Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
_sources Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
_static Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
0.18.2 Add multi-version documents (#3861) 2025-04-25 10:35:43 -07:00
0.19.0rc0 Add multi-version documents (#3861) 2025-04-25 10:35:43 -07:00
0.20.0rc0 Add multi-version documents (#3861) 2025-04-25 10:35:43 -07:00
advanced Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
architecture Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
blogs Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
commands Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
installation Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
latest Add multi-version documents (#3861) 2025-04-25 10:35:43 -07:00
llm-api Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
llm-api-examples Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
performance Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
python-api Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
reference Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
torch Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
.nojekyll update gh-pages (#3403) 2025-04-09 14:14:17 +08:00
genindex.html Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
index.html Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
key-features.html Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
objects.inv Update gh-pages (#3242) 2025-04-02 22:12:51 +08:00
overview.html Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
py-modindex.html Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
quick-start-guide.html Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
release-notes.html Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
search.html Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
searchindex.js Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00
torch.html Update GitHub pages to v0.18.2 (#3624) 2025-04-16 18:17:04 +08:00