TensorRT-LLMs/tests/integration/defs/accuracy/references
brb-nv 201fd257cc
[https://nvbugs/5478151][fix] Add missing spec for Llama-3.3 70B (#7267)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
Co-authored-by: Larry <197874197+LarryXFly@users.noreply.github.com>
2025-08-27 09:56:58 +08:00
..
cnn_dailymail.yaml [https://nvbugs/5448525][fix] Mistral Small 3.1 accuracy tests (#6909) 2025-08-18 11:17:37 +08:00
gpqa_diamond.yaml [https://nvbugs/5478151][fix] Add missing spec for Llama-3.3 70B (#7267) 2025-08-27 09:56:58 +08:00
gsm8k.yaml [https://nvbugs/5440241][fix] Fix 70B GSM8K Accuracy drop (#7075) 2025-08-20 18:11:00 -04:00
humaneval.yaml Update (#2978) 2025-03-23 16:39:35 +08:00
json_mode_eval.yaml test: Add json_mode_eval for guided decoding evaluation (#5179) 2025-06-16 10:03:55 +08:00
mmlu.yaml [https://nvbugs/5440241][fix] Fix 70B GSM8K Accuracy drop (#7075) 2025-08-20 18:11:00 -04:00
passkey_retrieval_64k.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
passkey_retrieval_128k.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
SlimPajama-6B.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
zero_scrolls.yaml Update (#2978) 2025-03-23 16:39:35 +08:00