TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Go to file
2025-11-25 03:40:39 +00:00
_cpp_gen Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
_downloads Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
_images Update GitHub pages in root to v1.1.0rc4 2025-09-10 06:17:44 +00:00
_modules Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
_sources Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
_static Insert 1.2.0rc4 into switcher.json 2025-11-25 03:40:38 +00:00
0.18.2 [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
0.19.0 [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
0.19.0rc0 [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
0.20.0 Update GitHub pages to v0.20.0 2025-06-24 03:02:01 +00:00
0.20.0rc0 [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
0.20.0rc1 [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
0.20.0rc2 [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
0.20.0rc3 [Docs] - Fix figure links 2025-05-20 13:00:19 +00:00
0.21.0 Update GitHub pages to v0.21.0 2025-08-04 04:23:03 +00:00
0.21.0rc0 Update GitHub pages to v0.21.0rc0 2025-06-04 02:39:55 +00:00
0.21.0rc1 Update GitHub pages to v0.21.0rc1 2025-06-11 02:46:36 +00:00
0.21.0rc2 Update GitHub pages to v0.21.0rc2 2025-06-18 05:57:01 +00:00
1.0.0 Update GitHub pages to v1.0.0 2025-09-24 02:05:54 +00:00
1.0.0rc0 Update GitHub pages to v1.0.0rc0 2025-06-25 02:49:39 +00:00
1.0.0rc1 Update GitHub pages to v1.0.0rc1 2025-07-01 09:49:03 +00:00
1.0.0rc2 Update GitHub pages to v1.0.0rc2 2025-07-08 02:03:18 +00:00
1.0.0rc3 Update GitHub pages to v1.0.0rc3 2025-07-16 02:09:51 +00:00
1.0.0rc4 Update GitHub pages to v1.0.0rc4 2025-07-22 03:09:08 +00:00
1.0.0rc5 Update GitHub pages to v1.0.0rc5 2025-08-04 06:33:28 +00:00
1.0.0rc6 Update GitHub pages to v1.0.0rc6 2025-08-07 06:26:13 +00:00
1.1.0rc0 Update GitHub pages to v1.1.0rc0 2025-08-15 14:08:04 +00:00
1.1.0rc1 Update GitHub pages to v1.1.0rc1 2025-08-22 08:41:57 +00:00
1.1.0rc2 Update GitHub pages to v1.1.0rc2 2025-08-30 02:33:38 +00:00
1.1.0rc2.post1 Update GitHub pages to v1.1.0rc2.post1 2025-09-05 13:10:33 +00:00
1.1.0rc2.post2 Update GitHub pages to v1.1.0rc2.post2 2025-09-15 00:47:55 +00:00
1.1.0rc3 Update GitHub pages to v1.1.0rc3 2025-09-04 03:19:09 +00:00
1.1.0rc4 Update GitHub pages to v1.1.0rc4 2025-09-10 06:17:42 +00:00
1.1.0rc5 Update GitHub pages to v1.1.0rc5 2025-09-17 05:52:48 +00:00
1.2.0rc0 Update GitHub pages to v1.2.0rc0 2025-09-30 03:07:05 +00:00
1.2.0rc0.post1 Update GitHub pages to v1.2.0rc0.post1 2025-10-14 05:13:54 +00:00
1.2.0rc1 Update GitHub pages to v1.2.0rc1 2025-10-22 01:55:43 +00:00
1.2.0rc2 Update GitHub pages to v1.2.0rc2 2025-11-07 02:24:00 +00:00
1.2.0rc3 Update GitHub pages to v1.2.0rc3 2025-11-21 07:33:25 +00:00
1.2.0rc4 Update GitHub pages to v1.2.0rc4 2025-11-25 03:40:38 +00:00
advanced Update GitHub pages in root to v1.1.0rc5 2025-09-17 05:52:50 +00:00
architecture Update GitHub pages in root to v1.1.0rc5 2025-09-17 05:52:50 +00:00
blogs Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
commands Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
deployment-guide Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
dev-on-cloud Update GitHub pages in root to v1.1.0rc5 2025-09-17 05:52:50 +00:00
developer-guide Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
examples Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
features Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
installation Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
latest Update latest GitHub pages to v1.2.0rc4 2025-11-25 03:40:39 +00:00
legacy Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
llm-api Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
models Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
performance Update GitHub pages in root to v1.1.0rc5 2025-09-17 05:52:50 +00:00
python-api Update GitHub pages in root to v1.1.0rc5 2025-09-17 05:52:50 +00:00
reference Update GitHub pages in root to v1.1.0rc5 2025-09-17 05:52:50 +00:00
scripts/disaggregated Update GitHub pages in root to v1.0.0rc4 2025-07-22 03:09:09 +00:00
torch Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
.buildinfo Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
.nojekyll update gh-pages (#3403) 2025-04-09 14:14:17 +08:00
genindex.html Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
index.html Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
key-features.html Update GitHub pages in root to v1.1.0rc5 2025-09-17 05:52:50 +00:00
objects.inv Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
overview.html Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
py-modindex.html Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
quick-start-guide.html Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
release-notes.html Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
search.html Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
searchindex.js Update GitHub pages in root to v1.2.0rc4 2025-11-25 03:40:39 +00:00
torch.html Update GitHub pages in root to v1.1.0rc5 2025-09-17 05:52:50 +00:00