TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Go to file
2026-01-09 08:00:34 +00:00
_cpp_gen Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
_downloads Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
_images
_includes Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
_modules Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
_sources Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
_static Insert 1.2.0rc2.post1 into switcher.json 2026-01-09 08:00:34 +00:00
0.18.2
0.19.0
0.19.0rc0
0.20.0
0.20.0rc0
0.20.0rc1
0.20.0rc2
0.20.0rc3
0.21.0
0.21.0rc0
0.21.0rc1
0.21.0rc2
1.0.0
1.0.0rc0
1.0.0rc1
1.0.0rc2
1.0.0rc3
1.0.0rc4
1.0.0rc5
1.0.0rc6
1.1.0 Update GitHub pages to v1.1.0 2025-12-19 05:53:16 +00:00
1.1.0rc0
1.1.0rc1
1.1.0rc2
1.1.0rc2.post1
1.1.0rc2.post2
1.1.0rc3
1.1.0rc4
1.1.0rc5
1.2.0rc0
1.2.0rc0.post1
1.2.0rc1
1.2.0rc2
1.2.0rc2.post1 Update GitHub pages to v1.2.0rc2.post1 2026-01-09 08:00:33 +00:00
1.2.0rc3
1.2.0rc4 Update GitHub pages to v1.2.0rc4 2025-11-25 03:40:38 +00:00
1.2.0rc5 Update GitHub pages to v1.2.0rc5 2025-12-10 03:07:21 +00:00
1.2.0rc6 Update GitHub pages to v1.2.0rc6 2025-12-23 02:41:09 +00:00
1.2.0rc6.post1 Update GitHub pages to v1.2.0rc6.post1 2026-01-07 12:09:20 +00:00
1.2.0rc7 Update GitHub pages to v1.2.0rc7 2026-01-08 05:43:59 +00:00
advanced
architecture
blogs Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
commands Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
deployment-guide Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
dev-on-cloud
developer-guide Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
examples Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
features Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
installation Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
latest Update latest GitHub pages to v1.2.0rc7 2026-01-08 05:44:01 +00:00
legacy Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
llm-api Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
models Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
performance
python-api
reference
scripts/disaggregated
torch Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
.buildinfo Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
.nojekyll
genindex.html Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
index.html Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
key-features.html
objects.inv Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
overview.html Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
py-modindex.html Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
quick-start-guide.html Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
release-notes.html Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
search.html Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
searchindex.js Update GitHub pages in root to v1.2.0rc7 2026-01-08 05:44:03 +00:00
torch.html