TensorRT-LLMs/cpp
Aurelien Chartier 7b38018fa0
feat: Add numNodes to ParallelConfig (#3346)
* Add numNodes to ParallelConfig

If not provided, attempt to find the number of nodes by
adding the number of local ranks 0

Update device IDs check accordingly

Signed-off-by: Aurelien Chartier <achartier@nvidia.com>

* Add ParallelConfig pickle test

Signed-off-by: Aurelien Chartier <achartier@nvidia.com>

---------

Signed-off-by: Aurelien Chartier <achartier@nvidia.com>
2025-04-13 13:55:04 +02:00
..
cmake fix #3109: early exit cmake if find_library() does not find any lib (#3113) 2025-03-29 19:59:03 +08:00
include/tensorrt_llm feat: Add numNodes to ParallelConfig (#3346) 2025-04-13 13:55:04 +02:00
micro_benchmarks perf: Add optimizations for deepseek in min latency mode (#3093) 2025-04-02 09:05:24 +08:00
tensorrt_llm feat: Add numNodes to ParallelConfig (#3346) 2025-04-13 13:55:04 +02:00
tests refactor: batch slot management in decoder classes (#3300) 2025-04-13 05:05:13 +08:00
CMakeLists.txt chore: Stabilize ABI boundary for internal kernel library (#3117) 2025-04-11 15:07:50 +08:00