TensorRT-LLMs/triton_backend
amirkl94 8429c8b139
chore: Port leftover 0.20 (#5907)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Yingge He <yinggeh@nvidia.com>
Signed-off-by: Martin Marciniszyn Mehringer <11665257+MartinMarciniszyn@users.noreply.github.com>
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
Co-authored-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Co-authored-by: Yingge He <157551214+yinggeh@users.noreply.github.com>
Co-authored-by: Martin Marciniszyn Mehringer <11665257+MartinMarciniszyn@users.noreply.github.com>
Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
Co-authored-by: zpatel <22306219+zbpatel@users.noreply.github.com>
2025-07-10 13:48:12 +02:00
..
all_models fix: only set _mpi_session if world_size is > 1 (#5253) 2025-06-17 19:21:41 -07:00
ci chore: Port leftover 0.20 (#5907) 2025-07-10 13:48:12 +02:00
inflight_batcher_llm [Infra] - Update dependencies with NGC PyTorch 25.05 and TRT 10.11 (#4885) 2025-06-17 23:48:34 +08:00
scripts feat: add multi-node support for Triton with pytorch backend (#5172) 2025-06-13 13:27:58 -07:00
tools [nvbug 5283506] fix: Fix spec decode triton test (#4845) 2025-06-09 08:40:17 -04:00
requirements.txt Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00