TensorRT-LLMs/examples/configs/curated
Jatin Gangani 97b38ac403
[None] [doc] Update IFB performance guide & GPTOSS deployment guide (#10283)
Signed-off-by: Jatin Gangani <jgangani@dc2-container-xterm-014.prd.it.nvidia.com>
Co-authored-by: Jatin Gangani <jgangani@dc2-container-xterm-014.prd.it.nvidia.com>
2025-12-25 05:52:04 -05:00
..
deepseek-r1-deepgemm.yaml [TRTC-43] [feat] Add config db and docs (#9420) 2025-12-12 04:00:03 +08:00
deepseek-r1-latency.yaml [TRTC-43] [feat] Add config db and docs (#9420) 2025-12-12 04:00:03 +08:00
deepseek-r1-throughput.yaml [TRTC-43] [feat] Add config db and docs (#9420) 2025-12-12 04:00:03 +08:00
gpt-oss-120b-latency.yaml [None] [doc] Update IFB performance guide & GPTOSS deployment guide (#10283) 2025-12-25 05:52:04 -05:00
gpt-oss-120b-throughput.yaml [None] [doc] Update IFB performance guide & GPTOSS deployment guide (#10283) 2025-12-25 05:52:04 -05:00
llama-3.3-70b.yaml [TRTC-43] [feat] Add config db and docs (#9420) 2025-12-12 04:00:03 +08:00
llama-4-scout.yaml [TRTC-43] [feat] Add config db and docs (#9420) 2025-12-12 04:00:03 +08:00
qwen3-disagg-prefill.yaml [TRTC-43] [feat] Add config db and docs (#9420) 2025-12-12 04:00:03 +08:00
qwen3-next.yaml [None][fix] enable KV cache reuse for config database (#10094) 2025-12-19 15:16:56 -08:00
qwen3.yaml [TRTC-43] [feat] Add config db and docs (#9420) 2025-12-12 04:00:03 +08:00