This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-14 06:27:45 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
e257cb3533
TensorRT-LLMs
/
tensorrt_llm
/
_torch
/
distributed
History
Shiyu Li
b093d94d34
[
https://nvbugs/5445466
][fix] Bypass MLP TP split for MNNVL in DeepSeek V3 to avoid hanging. (
#6886
)
...
Signed-off-by: Shiyu Li <shili@nvidia.com>
2025-08-28 15:17:48 -07:00
..
__init__.py
[TRTLLM-3927] [feat] Finalize + Allreduce + add + rmsnorm fusion (
#4756
)
2025-06-10 19:55:16 +08:00
communicator.py
feat: Skip sampler for intermediate pp stages. (
#4514
)
2025-05-26 10:08:51 +08:00
ops.py
[
https://nvbugs/5445466
][fix] Bypass MLP TP split for MNNVL in DeepSeek V3 to avoid hanging. (
#6886
)
2025-08-28 15:17:48 -07:00