TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-05 02:31:33 +08:00

History

Leslie Fang 31d04dfa12 [TRTLLM-9108][feat] Add test configurable moe module multi gpu (#10699 ) Signed-off-by: leslie-fang25 <leslief@nvidia.com>		2026-01-23 10:16:58 +08:00
..
gdrwrap.cpp	[None][fix] Allow multi-threaded copy for GDRCopy wrapper (#8535 )	2025-10-23 10:25:04 +08:00
gdrwrap.h	feat: large-scale EP(part 8: Online EP load balancer integration for PCIe fp8) (#5226 )	2025-06-25 22:25:13 -07:00
hostAccessibleDeviceAllocator.cpp	[None][feat] add flag for EPLB to force using GDRCopy (#8650 )	2025-10-29 13:33:26 +08:00
hostAccessibleDeviceAllocator.h	feat: large-scale EP(part 8: Online EP load balancer integration for PCIe fp8) (#5226 )	2025-06-25 22:25:13 -07:00
moeLoadBalancer.cpp	[None][fix] Fix EPLB CPU thread NUMA binding (#8579 )	2025-10-22 10:52:09 -04:00
moeLoadBalancer.h	[TRTLLM-9108][feat] Add test configurable moe module multi gpu (#10699 )	2026-01-23 10:16:58 +08:00
topologyDetector.cpp	feat: large-scale EP(part 8: Online EP load balancer integration for PCIe fp8) (#5226 )	2025-06-25 22:25:13 -07:00
topologyDetector.h	feat: large-scale EP(part 8: Online EP load balancer integration for PCIe fp8) (#5226 )	2025-06-25 22:25:13 -07:00