TensorRT-LLMs/tensorrt_llm/auto_parallel
2024-11-05 10:17:16 +00:00
..
tensor_parallel TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
__init__.py TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
auto_parallel.py Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
cluster_info.py open source v0.12-jetson 2024-11-05 10:17:16 +00:00
config.py TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
device_mesh.py Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
node_graph.py Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
parallelization.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
pipeline_graph.py TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
runtime_profiling.py Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
shape_info.py TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
simplifier.py TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00
solver.py Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
utils.py TensorRT-LLM v0.11 Update (#1969) 2024-07-17 20:45:02 +08:00