TensorRT-LLMs/tests/integration/defs/accuracy/references
brb-nv 43f6ad7813
[https://nvbugs/5708475][fix] Fix e2e eval accuracy for helix parallelism (#9647)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
2025-12-03 15:13:59 +08:00
..
cnn_dailymail.yaml [None][feat] Optimize MLA kernels with separate reduction kernels (#7597) 2025-09-09 16:58:44 +08:00
gpqa_diamond.yaml [None][test] Enhance GPT-OSS CI with GPQA Diamond and additional Spec Decoding Test (#8661) 2025-11-02 16:44:02 -08:00
gsm8k.yaml [https://nvbugs/5708475][fix] Fix e2e eval accuracy for helix parallelism (#9647) 2025-12-03 15:13:59 +08:00
humaneval.yaml Update (#2978) 2025-03-23 16:39:35 +08:00
json_mode_eval.yaml [TRTLLM-7028][feat] Enable guided decoding with speculative decoding (part 2: one-model engine) (#6948) 2025-09-03 15:16:11 -07:00
longbench_v2.yaml [None][feat] Add RocketKV usage doc and e2e accuracy test on LongBenchV2 (#9572) 2025-12-03 11:33:46 +08:00
mmlu.yaml [None][feat] add qwen3-next CI test of accuracy on BF16 and NVFP4 (#9330) 2025-11-27 18:05:00 +08:00
mmmu.yaml [TRTLLM-9264][fix] Add accuracy/unit tests/doc for phi4mm (#9246) 2025-11-26 11:12:35 +08:00
passkey_retrieval_64k.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
passkey_retrieval_128k.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
SlimPajama-6B.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
zero_scrolls.yaml Update (#2978) 2025-03-23 16:39:35 +08:00