TensorRT-LLMs/cpp/tensorrt_llm
Aurelien Chartier 7b38018fa0
feat: Add numNodes to ParallelConfig (#3346)
* Add numNodes to ParallelConfig

If not provided, attempt to find the number of nodes by
adding the number of local ranks 0

Update device IDs check accordingly

Signed-off-by: Aurelien Chartier <achartier@nvidia.com>

* Add ParallelConfig pickle test

Signed-off-by: Aurelien Chartier <achartier@nvidia.com>

---------

Signed-off-by: Aurelien Chartier <achartier@nvidia.com>
2025-04-13 13:55:04 +02:00
..
batch_manager refactor: batch slot management in decoder classes (#3300) 2025-04-13 05:05:13 +08:00
common chore: Stabilize ABI boundary for internal kernel library (#3117) 2025-04-11 15:07:50 +08:00
cutlass_extensions/include/cutlass_extensions feat: Update cutlass (#2981) 2025-03-26 22:36:27 +08:00
executor feat: Add numNodes to ParallelConfig (#3346) 2025-04-13 13:55:04 +02:00
executor_worker Update TensorRT-LLM (#2792) 2025-02-18 21:27:39 +08:00
kernels fix: Eagle decoding (#3456) 2025-04-11 22:06:38 +08:00
layers fix: Eagle decoding (#3456) 2025-04-11 22:06:38 +08:00
plugins Feat: Variable-Beam-Width-Search (VBWS) part3 (#3338) 2025-04-08 23:51:27 +08:00
pybind feat: Add numNodes to ParallelConfig (#3346) 2025-04-13 13:55:04 +02:00
runtime feat: Add numNodes to ParallelConfig (#3346) 2025-04-13 13:55:04 +02:00
thop feat: Add NVFP4 UB pattern optimization pass in torch compile (#3371) 2025-04-11 21:25:29 +08:00
CMakeLists.txt chore: Stabilize ABI boundary for internal kernel library (#3117) 2025-04-11 15:07:50 +08:00