TensorRT-LLMs/docs/source
Grzegorz Kwasniewski cff54fcae3
[#8948][feat] Support custom sharding config (#9143)
Signed-off-by: greg-kwasniewski1 <213329731+greg-kwasniewski1@users.noreply.github.com>
2025-11-29 05:28:05 +08:00
..
_static [None][doc] Enhance api reference doc by labeling stable APIs (#7751) 2025-09-22 14:28:38 +08:00
_templates Update TensorRT-LLM (#1725) 2024-06-04 20:26:32 +08:00
blogs [None][doc] Paragraph adjustment and fix statistic (#8568) 2025-10-22 03:26:09 -04:00
commands [TRTLLM-9085][doc] fix math formula rendering issues (#9481) 2025-11-27 10:09:12 +08:00
deployment-guide [TRTLLM-9513][docs] Qwen3 deployment guide (#9488) 2025-11-27 14:12:35 +08:00
developer-guide [TRTLLM-9179][feat] add pp_partition to customize each rank's layer number (#9003) 2025-11-13 10:34:17 +08:00
examples [None][doc] Rename TensorRT-LLM to TensorRT LLM. (#7554) 2025-09-09 12:16:03 +08:00
features [#9150][feat] AutoDeploy Nemotron-Flash support (#9504) 2025-11-27 18:03:57 +01:00
installation [None][chore] Lock onnx version <1.20.0 and remove WAR for TRT 10.13 (#9006) 2025-11-10 10:34:06 +08:00
legacy [TRTLLM-8994][infra] upgrade to DLFW 25.10 and pytorch 2.9.0 / triton 3.5.0 (#8838) 2025-11-04 18:59:34 +08:00
llm-api [https://nvbugs/5416501][doc] add known issues to llmapi doc (#7560) 2025-09-22 14:28:38 +08:00
media [None][doc] Add doc for torch.compile & piecewise cuda graph (#8527) 2025-10-29 21:15:46 -07:00
models [None][feat] Support kv_cahce_reuse for HyperCLOVAX-Vision model (#7789) 2025-10-21 11:11:24 +09:00
torch [#8948][feat] Support custom sharding config (#9143) 2025-11-29 05:28:05 +08:00
conf.py [None][doc] Update docker cmd in quick start guide and trtllm-serve … (#7787) 2025-09-25 21:02:35 +08:00
helper.py [TRTLLM-8680][doc] Add table with one-line deployment commands to docs (#8173) 2025-11-03 17:42:41 -08:00
index.rst [None][doc] Add doc for torch.compile & piecewise cuda graph (#8527) 2025-10-29 21:15:46 -07:00
overview.md [None][doc] Add the missing content for model support section and fix valid links for long_sequence.md (#8869) 2025-11-03 02:06:04 -08:00
quick-start-guide.md [TRTLLM-8680][doc] Add table with one-line deployment commands to docs (#8173) 2025-11-03 17:42:41 -08:00
release-notes.md [None][doc] add Llama PP known issue to release note (#7959) 2025-09-25 21:02:35 +08:00