TensorRT-LLMs/docs/source/legacy/key-features.md
Guoming Zhang 085271eceb
[None][doc] Clean the doc folder and move the outdated docs into lega… (#7729)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
2025-09-16 11:43:19 +08:00

11 lines
442 B
Markdown

# Key Features
This document lists key features supported in TensorRT-LLM.
- [Quantization](../source/reference/precision.md)
- [Inflight Batching](../source/advanced/gpt-attention.md#in-flight-batching)
- [Chunked Context](../source/advanced/gpt-attention.md#chunked-context)
- [LoRA](../source/advanced/lora.md)
- [KV Cache Reuse](../source/advanced/kv-cache-reuse.md)
- [Speculative Sampling](../source/advanced/speculative-decoding.md)