TensorRT-LLMs/tests/integration/defs/accuracy/references/json_mode_eval.yaml
Ivy Zhang 1e828587e5
[TRTLLM-9896][test] add vswa test cases coverage (#10146)
Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>
2026-01-06 02:02:29 -05:00

22 lines
499 B
YAML

meta-llama/Llama-3.1-8B-Instruct:
- accuracy: 74.00
- spec_dec_algo: Eagle
accuracy: 74.00
- spec_dec_algo: NGram
accuracy: 74.00
deepseek-ai/DeepSeek-V3-Lite:
- accuracy: 77.00
- spec_dec_algo: MTP
accuracy: 77.00
google/gemma-3-1b-it:
- quant_algo: FP8
kv_cache_quant_algo: FP8
accuracy: 61.00
GPT-OSS/120B-MXFP4:
- quant_algo: W4A16_MXFP4
spec_dec_algo: Eagle
accuracy: 62.00
- quant_algo: W4A8_MXFP4_MXFP8
spec_dec_algo: Eagle
accuracy: 62.00