TensorRT-LLMs/tests/integration/defs/accuracy/references/json_mode_eval.yaml
Enwei Zhu 1b9781e8e7
[TRTLLM-6409][feat] Enable guided decoding with speculative decoding (part 1: two-model engine) (#6300)
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
2025-08-07 05:53:48 -04:00

7 lines
144 B
YAML

meta-llama/Llama-3.1-8B-Instruct:
- accuracy: 74.00
- spec_dec_algo: Eagle
accuracy: 74.00
- spec_dec_algo: NGram
accuracy: 74.00