TensorRT-LLMs/tensorrt_llm/plugin
Zhenhuan Chen e47927e847
[None][fix] change allreduce workspace dtype to torch.int64 to avoid overflow (#9479)
Signed-off-by: Zhenhuan Chen <zhenhuanc@nvidia.com>
2025-11-27 17:08:41 +08:00
..
__init__.py Update TensorRT-LLM (#1019) 2024-01-31 21:55:32 +08:00
plugin.py [None][fix] change allreduce workspace dtype to torch.int64 to avoid overflow (#9479) 2025-11-27 17:08:41 +08:00