TensorRT-LLMs/_sources/key-features.md.txt
2024-08-30 13:09:14 +08:00

11 lines
408 B
Plaintext

# Key Features
This document lists key features supported in TensorRT-LLM.
- [Quantization](../source/reference/precision.md)
- [Inflight Batching](../source/advanced/gpt-attention.md#in-flight-batching)
- [Chunked Context](../source/advanced/gpt-attention.md#chunked-context)
- [LoRA](../source/advanced/lora.md)
- [KV Cache Reuse](./kv_cache_reuse.md)
- [Speculative Sampling](./speculative_decoding.md)