xavier-nvidia
|
7b818de700
|
Fix TMA error with GEMM+AR on TP=2 (#6071)
Signed-off-by: Xavier Simmons <xsimmons@nvidia.com>
|
2025-07-16 10:27:32 -07:00 |
|
xavier-nvidia
|
60538df1b5
|
Fix GEMM+AR nvbugs 5219533,5127801,5072306 (#5969)
Signed-off-by: xsimmons <xsimmons@nvidia.com>
|
2025-07-11 10:22:02 -07:00 |
|
yunruis
|
30c5b4183a
|
refactoring: port customized kernels with public cutlass version (#5027)
Signed-off-by: yunruis
Merge this to unblock others since the full CI has been run through
|
2025-06-13 16:19:31 +08:00 |
|
Kaiyu Xie
|
2ea17cdad2
|
Update TensorRT-LLM (#2792)
* Update TensorRT-LLM
---------
Co-authored-by: jlee <jungmoolee@clika.io>
|
2025-02-18 21:27:39 +08:00 |
|
Dan Blanaru
|
16d2467ea8
|
Update TensorRT-LLM (#2755)
* Update TensorRT-LLM
---------
Co-authored-by: Denis Kayshev <topenkoff@gmail.com>
Co-authored-by: akhoroshev <arthoroshev@gmail.com>
Co-authored-by: Patrick Reiter Horn <patrick.horn@gmail.com>
Update
|
2025-02-11 03:01:00 +00:00 |
|