TensorRT-LLMs/tensorrt_llm/auto_parallel
Kaiyu Xie bf0a5afc92
Update TensorRT-LLM (#1598)
* Update TensorRT-LLM
2024-05-14 16:43:41 +08:00
..
tensor_parallel Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
__init__.py Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
auto_parallel.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
cluster_info.py Update TensorRT-LLM (#1492) 2024-04-24 14:44:22 +08:00
config.py Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
device_mesh.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
node_graph.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
parallelization.py Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
pipeline_graph.py Update TensorRT-LLM (#1427) 2024-04-09 17:03:34 +08:00
runtime_profiling.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
shape_info.py Update TensorRT-LLM (#1427) 2024-04-09 17:03:34 +08:00
simplifier.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
solver.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
utils.py Update TensorRT-LLM (#1492) 2024-04-24 14:44:22 +08:00