TensorRT-LLMs/tests/integration/defs/accuracy/references
xinhe-nv 5f939b9121
[None][chore] Add failed cases into waives.txt (#7342)
Signed-off-by: Xin He (SW-GPU) <200704525+xinhe-nv@users.noreply.github.com>
2025-08-30 00:49:14 -04:00
..
cnn_dailymail.yaml [None][chore] Add failed cases into waives.txt (#7342) 2025-08-30 00:49:14 -04:00
gpqa_diamond.yaml [https://nvbugs/5409414][fix] fix Not registered specs (#6660) 2025-08-07 17:55:53 +10:00
gsm8k.yaml [None][chore] Update pre-merge test to add DeepSeek/LLaMA and gpt-oss (#7192) 2025-08-29 17:03:46 +08:00
humaneval.yaml Update (#2978) 2025-03-23 16:39:35 +08:00
json_mode_eval.yaml [TRTLLM-6409][feat] Enable guided decoding with speculative decoding (part 1: two-model engine) (#6300) 2025-08-07 05:53:48 -04:00
mmlu.yaml [None][chore] Update pre-merge test to add DeepSeek/LLaMA and gpt-oss (#7192) 2025-08-29 17:03:46 +08:00
mmmu.yaml [TRTLLM-6771][feat] Support MMMU for multimodal models (#6828) 2025-08-21 08:54:12 +08:00
passkey_retrieval_64k.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
passkey_retrieval_128k.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
SlimPajama-6B.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
zero_scrolls.yaml Update (#2978) 2025-03-23 16:39:35 +08:00