TensorRT-LLMs/json_mode_eval.yaml at c9fe07ede649d3f9659728362af39883f5f502ce - TensorRT-LLMs - Gitea: Git with a cup of tea

kanshan/TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

Enwei Zhu 1b9781e8e7

[TRTLLM-6409][feat] Enable guided decoding with speculative decoding (part 1: two-model engine) (#6300 )

Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>

2025-08-07 05:53:48 -04:00

7 lines

144 B

YAML

Raw Blame History

 meta-llama/Llama-3.1-8B-Instruct:
   - accuracy: 74.00
   - spec_dec_algo: Eagle
     accuracy: 74.00
   - spec_dec_algo: NGram
     accuracy: 74.00