This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-14 06:27:45 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
5506f60037
TensorRT-LLMs
/
tensorrt_llm
/
bench
History
Yan Chunwei
5506f60037
chore [BREAKING CHANGE]: Flatten PyTorchConfig knobs into TorchLlmArgs (
#4603
)
...
Signed-off-by: Superjomn <328693+Superjomn@users.noreply.github.com>
2025-05-28 18:43:04 +08:00
..
benchmark
chore [BREAKING CHANGE]: Flatten PyTorchConfig knobs into TorchLlmArgs (
#4603
)
2025-05-28 18:43:04 +08:00
build
test(perf): Add some
Llama-3_3-Nemotron-Super-49B-v1
integration-perf-tests (TRT flow, trtllm-bench) (
#4128
)
2025-05-19 12:00:48 -07:00
dataclasses
chore [BREAKING CHANGE]: Flatten PyTorchConfig knobs into TorchLlmArgs (
#4603
)
2025-05-28 18:43:04 +08:00
utils
[TRTLLM-5054][fix] Removing repeated loading of input processor (
#4161
)
2025-05-16 08:04:58 +08:00
__init__.py
Update TensorRT-LLM
2024-08-20 18:55:15 +08:00