TensorRT-LLMs/triton_backend
Emma Qiao ff32caf4d7
[Infra] - Update dependencies with NGC PyTorch 25.05 and TRT 10.11 (#4885)
Signed-off-by: qqiao <qqiao@nvidia.com>
Signed-off-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
Signed-off-by: Erin Ho <14718778+hchings@users.noreply.github.com>
Signed-off-by: Emma Qiao <qqiao@nvidia.com>
Co-authored-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
Co-authored-by: Erin Ho <14718778+hchings@users.noreply.github.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
2025-06-17 23:48:34 +08:00
..
all_models feat: add multi-node support for Triton with pytorch backend (#5172) 2025-06-13 13:27:58 -07:00
ci Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
inflight_batcher_llm [Infra] - Update dependencies with NGC PyTorch 25.05 and TRT 10.11 (#4885) 2025-06-17 23:48:34 +08:00
scripts feat: add multi-node support for Triton with pytorch backend (#5172) 2025-06-13 13:27:58 -07:00
tools [nvbug 5283506] fix: Fix spec decode triton test (#4845) 2025-06-09 08:40:17 -04:00
requirements.txt Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00