TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Go to file
2023-12-04 18:59:41 +08:00
_cpp_gen Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
_downloads update github pages (#540) 2023-12-04 16:26:13 +08:00
_modules Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
_sources Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
_static update github pages (#540) 2023-12-04 16:26:13 +08:00
blogs Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
python-api Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
.nojekyll gh-pages for release/0.5.0 2023-10-19 12:25:48 +00:00
2023-05-17-how-to-add-a-new-model.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
2023-05-19-how-to-debug.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
architecture.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
batch_manager.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
genindex.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
gpt_attention.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
gpt_runtime.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
graph-rewriting.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
index.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
installation.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
memory.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
objects.inv Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
performance.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
precision.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
py-modindex.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
search.html Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00
searchindex.js Update GitHub pages (#544) 2023-12-04 18:59:41 +08:00