This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-29 23:23:48 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
79e0296ca0
TensorRT-LLMs
/
tests
/
unittest
/
_torch
/
auto_deploy
History
Grzegorz Kwasniewski
3755f8ab7d
[TRTLLM-6342][fix] Fixed triggering BMM sharding (
#7389
)
...
Signed-off-by: greg-kwasniewski1 <213329731+greg-kwasniewski1@users.noreply.github.com>
2025-09-04 02:01:27 -04:00
..
_utils_test
[
#4403
][autodeploy] Refactor: Move more transformations to new inf optimizer, Add quantization_source to factory interface (
#6760
)
2025-08-11 22:02:46 -07:00
integration
[None][doc] Update autodeploy README.md, deprecate lm_eval in examples folder (
#7233
)
2025-08-26 10:47:57 -07:00
unit
[TRTLLM-6342][fix] Fixed triggering BMM sharding (
#7389
)
2025-09-04 02:01:27 -04:00