TensorRT-LLMs/tensorrt_llm/auto_parallel
2024-03-19 17:36:42 +08:00
..
tensor_parallel Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
__init__.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
auto_parallel.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
config.py Update TensorRT-LLM (#1315) 2024-03-19 17:36:42 +08:00
device_mesh.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
node_graph.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
parallelization.py Update TensorRT-LLM (#1315) 2024-03-19 17:36:42 +08:00
pipeline_graph.py Update TensorRT-LLM (#1315) 2024-03-19 17:36:42 +08:00
runtime_profiling.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
shape_info.py Update TensorRT-LLM (#1315) 2024-03-19 17:36:42 +08:00
simplifier.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
solver.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
utils.py Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00