TensorRT-LLMs/tensorrt_llm/auto_parallel
amirkl94 fbec0c3552
Release 0.20 to main (#4577)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>
Signed-off-by: Martin Marciniszyn Mehringer <11665257+MartinMarciniszyn@users.noreply.github.com>
Signed-off-by: Yuan Tong <13075180+tongyuantongyu@users.noreply.github.com>
Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>
Signed-off-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
Signed-off-by: Venky <23023424+venkywonka@users.noreply.github.com>
Signed-off-by: Ruodi <200874449+ruodil@users.noreply.github.com>
Signed-off-by: Stefan Niebler <82932102+stnie@users.noreply.github.com>
Signed-off-by: Simeng Liu <simengl@nvidia.com>
Signed-off-by: Faraz Khoubsirat <58580514+farazkh80@users.noreply.github.com>
Signed-off-by: moraxu <mguzek@nvidia.com>
Signed-off-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
Signed-off-by: Jinyang Yuan <154768711+jinyangyuan-nvidia@users.noreply.github.com>
Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
Co-authored-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
Co-authored-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>
Co-authored-by: Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
Co-authored-by: Martin Marciniszyn Mehringer <11665257+MartinMarciniszyn@users.noreply.github.com>
Co-authored-by: Yuan Tong <13075180+tongyuantongyu@users.noreply.github.com>
Co-authored-by: Yukun He <23156053+hyukn@users.noreply.github.com>
Co-authored-by: Yuxian Qiu <142763828+yuxianq@users.noreply.github.com>
Co-authored-by: Venky <23023424+venkywonka@users.noreply.github.com>
Co-authored-by: ruodil <200874449+ruodil@users.noreply.github.com>
Co-authored-by: stnie <82932102+stnie@users.noreply.github.com>
Co-authored-by: Simeng Liu <109828133+SimengLiu-nv@users.noreply.github.com>
Co-authored-by: Faraz <58580514+farazkh80@users.noreply.github.com>
Co-authored-by: Michal Guzek <moraxu@users.noreply.github.com>
Co-authored-by: Iman Tabrizian <10105175+Tabrizian@users.noreply.github.com>
Co-authored-by: Jinyang Yuan <154768711+jinyangyuan-nvidia@users.noreply.github.com>
2025-05-28 16:25:33 +08:00
..
tensor_parallel chore: remove usernames from comments (#3291) 2025-04-05 13:44:28 +08:00
__init__.py Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
auto_parallel.py Release 0.20 to main (#4577) 2025-05-28 16:25:33 +08:00
cluster_info.py fix: Fix NVLink version decoding. (#3996) 2025-05-06 13:56:50 +08:00
config.py Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
device_mesh.py chore: remove usernames from comments (#3291) 2025-04-05 13:44:28 +08:00
node_graph.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
parallelization.py Update TensorRT-LLM (#2582) 2024-12-16 21:50:47 -08:00
pipeline_graph.py Update TensorRT-LLM (#2582) 2024-12-16 21:50:47 -08:00
runtime_profiling.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
shape_info.py Update TensorRT-LLM (#1688) 2024-05-28 20:07:49 +08:00
simplifier.py Update TensorRT-LLM (#1639) 2024-05-21 17:51:02 +08:00
solver.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
utils.py Update TensorRT-LLM (#1688) 2024-05-28 20:07:49 +08:00