TensorRT-LLMs/examples/llm-api/_tensorrt_engine
2025-12-07 07:14:05 -08:00
..
llm_eagle2_decoding.py [refactor] Simplification of Speculative decoding configs (#5639) 2025-07-10 11:37:30 -04:00
llm_eagle_decoding.py [refactor] Simplification of Speculative decoding configs (#5639) 2025-07-10 11:37:30 -04:00
llm_inference_customize.py chores: merge examples for v1.0 doc (#5736) 2025-07-08 21:00:42 -07:00
llm_inference_kv_events.py [TRTLLM-5530] chore: rename LLM.autotuner_enabled to enable_autotuner (#5876) 2025-07-10 11:31:35 +08:00
llm_lookahead_decoding.py [TRTLLM-5277] chore: refine llmapi examples for 1.0 (part1) (#5431) 2025-07-01 19:06:41 +08:00
llm_medusa_decoding.py [OMNIML-3036][doc] Re-branding TensorRT-Model-Optimizer as Nvidia Model-Optimizer (#9679) 2025-12-07 07:14:05 -08:00
llm_quantization.py chores: merge examples for v1.0 doc (#5736) 2025-07-08 21:00:42 -07:00
quickstart_example.py [OMNIML-3036][doc] Re-branding TensorRT-Model-Optimizer as Nvidia Model-Optimizer (#9679) 2025-12-07 07:14:05 -08:00