TensorRT-LLMs/tests/integration/defs/accuracy/references
William Zhang ff7eb93f31
[https://nvbugs/5669097][tests] Add MMMU test for mistral small (#10530)
Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com>
2026-01-09 16:09:28 -08:00
..
cnn_dailymail.yaml [None][feat] Optimize MLA kernels with separate reduction kernels (#7597) 2025-09-09 16:58:44 +08:00
gpqa_diamond.yaml [None][test] Enhance GPT-OSS CI with GPQA Diamond and additional Spec Decoding Test (#8661) 2025-11-02 16:44:02 -08:00
gsm8k.yaml [https://nvbugs/5698434][test] add qwen3-4b accuracy test case (#10382) 2026-01-06 21:56:34 -05:00
humaneval.yaml Update (#2978) 2025-03-23 16:39:35 +08:00
json_mode_eval.yaml [TRTLLM-9896][test] add vswa test cases coverage (#10146) 2026-01-06 02:02:29 -05:00
longbench_v1.yaml [None][infra] Add LongBenchV1 to trtllm-eval. (#10265) 2025-12-30 21:39:34 +08:00
longbench_v2.yaml [None][chore] Move the rocketkv e2e test to post-merge (#9768) 2025-12-08 13:22:16 +08:00
mmlu.yaml [None][fix] Mistral large 3 few code refine (#10405) 2026-01-08 06:38:49 -05:00
mmmu.yaml [https://nvbugs/5669097][tests] Add MMMU test for mistral small (#10530) 2026-01-09 16:09:28 -08:00
passkey_retrieval_64k.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
passkey_retrieval_128k.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
SlimPajama-6B.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
zero_scrolls.yaml Update (#2978) 2025-03-23 16:39:35 +08:00