This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-22 03:35:00 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
2f526583fb
TensorRT-LLMs
/
tensorrt_llm
/
commands
History
JunyiXu-nv
b210f22c7e
[
https://nvbugs/5703953
][fix] Preserving ip:port for trtllm-serve before initializing llm (
#9646
)
...
Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
2025-12-06 20:13:48 -08:00
..
__init__.py
Update TensorRT-LLM (
#613
)
2023-12-08 17:49:24 +08:00
bench.py
[
#9463
][feat] Add revision option to trtllm commands (
#9498
)
2025-11-27 09:30:01 +08:00
build.py
[TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (
#8330
)
2025-10-28 09:17:26 -07:00
eval.py
[None][chore] Fix trtllm-eval and move GroupedGemmInputsHelper (
#9612
)
2025-12-03 07:55:03 +08:00
prune.py
Update TensorRT-LLM (
#2008
)
2024-07-23 23:05:09 +08:00
refit.py
[TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (
#8330
)
2025-10-28 09:17:26 -07:00
serve.py
[
https://nvbugs/5703953
][fix] Preserving ip:port for trtllm-serve before initializing llm (
#9646
)
2025-12-06 20:13:48 -08:00