TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Go to file
2025-05-20 09:23:49 +00:00
_cpp_gen Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
_downloads Fix main page 2025-04-26 05:56:13 +00:00
_images Fix main page 2025-04-26 05:56:13 +00:00
_modules Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
_sources Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
_static fix: gh-pages switcher (#4287) 2025-05-14 12:34:05 +08:00
0.18.2 [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
0.19.0 [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
0.19.0rc0 [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
0.20.0rc0 [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
0.20.0rc1 [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
0.20.0rc2 [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
0.20.0rc3 Update GitHub pages to v0.20.0rc3 2025-05-20 09:23:49 +00:00
advanced Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
architecture Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
blogs Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
commands Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
dev-on-cloud Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
examples Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
installation Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
latest [Docs] - Clean up legacy switcher.json files (#4439) 2025-05-19 22:52:17 +08:00
llm-api Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
performance Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
python-api Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
reference Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
torch Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
.nojekyll update gh-pages (#3403) 2025-04-09 14:14:17 +08:00
genindex.html Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
index.html Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
key-features.html Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
objects.inv Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
overview.html Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
py-modindex.html Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
quick-start-guide.html Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
release-notes.html Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
search.html Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
searchindex.js Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00
torch.html Update gh-pages (#4284) 2025-05-14 11:12:52 +08:00