TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Go to file

石晓伟 cb04a921a1 correction of typo (#115 )		2023-10-25 19:55:42 +08:00
_cpp_gen	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
_downloads/29c17f8c7171976309d720e2b031e77e	gh-pages for release/0.5.0	2023-10-19 12:25:48 +00:00
_modules	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
_sources	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
_static	gh-pages for release/0.5.0	2023-10-19 12:25:48 +00:00
python-api	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
.nojekyll	gh-pages for release/0.5.0	2023-10-19 12:25:48 +00:00
2023-05-17-how-to-add-a-new-model.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
2023-05-19-how-to-debug.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
architecture.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
batch_manager.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
genindex.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
gpt_attention.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
gpt_runtime.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
graph-rewriting.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
index.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
installation.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
objects.inv	gh-pages for release/0.5.0	2023-10-19 12:25:48 +00:00
performance.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
precision.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
py-modindex.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
search.html	correction of typo (#115 )	2023-10-25 19:55:42 +08:00
searchindex.js	correction of typo (#115 )	2023-10-25 19:55:42 +08:00