TensorRT-LLMs/tensorrt_llm/auto_parallel
Kaiyu Xie aaacc9bd68
Update TensorRT-LLM (#2562)
* Update TensorRT-LLM

---------

Co-authored-by: Starrick Liu <73152103+StarrickLiu@users.noreply.github.com>
2024-12-11 00:31:05 -08:00
..
tensor_parallel Update TensorRT-LLM (#2562) 2024-12-11 00:31:05 -08:00
__init__.py Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
auto_parallel.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
cluster_info.py Update TensorRT-LLM (#2562) 2024-12-11 00:31:05 -08:00
config.py Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
device_mesh.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
node_graph.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
parallelization.py Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
pipeline_graph.py Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
runtime_profiling.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
shape_info.py Update TensorRT-LLM (#1688) 2024-05-28 20:07:49 +08:00
simplifier.py Update TensorRT-LLM (#1639) 2024-05-21 17:51:02 +08:00
solver.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
utils.py Update TensorRT-LLM (#1688) 2024-05-28 20:07:49 +08:00