This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-14 06:27:45 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
release/1.0
TensorRT-LLMs
/
cpp
/
tests
/
resources
History
Robin Kobus
4cd8543d8c
[TRTLLM-1316] refactor: Remove unnecessary pipeline parallelism logic from postProcessRequest (
#5489
)
...
Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>
2025-07-02 10:13:31 +02:00
..
data
[TRTLLM-4460] test: Use Llama 3.2 1B for Llama C++ tests (
#3206
)
2025-05-01 05:31:08 +08:00
models
Initial commit
2023-09-20 00:29:41 -07:00
scripts
[TRTLLM-1316] refactor: Remove unnecessary pipeline parallelism logic from postProcessRequest (
#5489
)
2025-07-02 10:13:31 +02:00
.gitignore
Update TensorRT-LLM (
#2532
)
2024-12-04 21:16:56 +08:00