TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Zhenhuan Chen e47927e847 [None][fix] change allreduce workspace dtype to torch.int64 to avoid overflow (#9479 ) Signed-off-by: Zhenhuan Chen <zhenhuanc@nvidia.com>		2025-11-27 17:08:41 +08:00
..
__init__.py	Update TensorRT-LLM (#1019 )	2024-01-31 21:55:32 +08:00
plugin.py	[None][fix] change allreduce workspace dtype to torch.int64 to avoid overflow (#9479 )	2025-11-27 17:08:41 +08:00