TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Go to file
2024-11-06 14:22:52 +08:00
_cpp_gen Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
_downloads fix documents issues (#2409) 2024-11-04 15:10:33 +08:00
_modules add the missing files (#2418) 2024-11-06 14:22:52 +08:00
_sources fix documents issues (#2409) 2024-11-04 15:10:33 +08:00
_static fix documents issues (#2409) 2024-11-04 15:10:33 +08:00
advanced fix documents issues (#2409) 2024-11-04 15:10:33 +08:00
architecture Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
blogs Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
commands Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
installation Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
llm-api update llm api reference page. (#2410) 2024-11-05 14:01:36 +08:00
llm-api-examples fix documents issues (#2409) 2024-11-04 15:10:33 +08:00
performance fix documents issues (#2409) 2024-11-04 15:10:33 +08:00
python-api Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
reference Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
.nojekyll gh-pages for release/0.5.0 2023-10-19 12:25:48 +00:00
genindex.html Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
index.html Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
key-features.html Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
objects.inv Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
overview.html Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
py-modindex.html Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
quick-start-guide.html Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
release-notes.html Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
search.html Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00
searchindex.js Update gh-pages (#2404) 2024-11-01 20:31:15 +08:00