TensorRT-LLMs/triton_backend
Xiwen Yu 52ad4436bc disable 3xfp4
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
2025-08-06 14:25:05 +08:00
..
all_models chore: delete useless gitkeep files. (#6400) 2025-07-28 11:38:30 -04:00
ci chore: Port leftover 0.20 (#5907) 2025-07-22 12:48:00 +08:00
inflight_batcher_llm disable 3xfp4 2025-08-06 14:25:05 +08:00
scripts [nvbug/5308432] fix: extend triton exit time for test_llava (#5971) 2025-07-12 12:56:37 +09:00
tools feat: Add support for Triton request cancellation (#5898) 2025-07-15 20:52:43 -04:00
requirements.txt Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00