mirror of https://github.com/vllm-project/vllm.git synced 2026-06-06 00:16:14 +00:00

Files

T

Mohammad Miadh Angkad da03e549b3 [UX] Add a persistent cache for FlashInfer autotuning (#42537 )

Signed-off-by: Mohammad Miadh Angkad <176301910+mmangkad@users.noreply.github.com>

2026-05-18 20:25:37 -07:00

faq.md

2025-07-08 03:27:40 -07:00

metrics.md

2026-02-23 15:01:07 +00:00

README.md

2025-11-15 05:33:27 -08:00

reproducibility.md

2026-04-28 00:33:41 -07:00

security.md

2026-05-18 20:25:37 -07:00

troubleshooting.md

2026-03-12 07:57:47 -07:00

usage_stats.md

2025-10-16 20:05:34 -07:00

v1_guide.md

2026-03-08 20:05:24 -07:00

Using vLLM

First, vLLM must be installed for your chosen device in either a Python or Docker environment.

Then, vLLM supports the following usage patterns: