TensorRT-LLMs/tests/integration/defs/accuracy/references
Wanli Jiang d100599ea7
[TRTLLM-9264][fix] Add accuracy/unit tests/doc for phi4mm (#9246)
Signed-off-by: Wanli Jiang <35160485+Wanli-Jiang@users.noreply.github.com>
2025-11-26 11:12:35 +08:00
..
cnn_dailymail.yaml [None][feat] Optimize MLA kernels with separate reduction kernels (#7597) 2025-09-09 16:58:44 +08:00
gpqa_diamond.yaml [None][test] Enhance GPT-OSS CI with GPQA Diamond and additional Spec Decoding Test (#8661) 2025-11-02 16:44:02 -08:00
gsm8k.yaml [TRTLLM-7967][feat] Adding Starcoder2 PyTorch Backend Support (#8923) 2025-11-24 11:23:22 -08:00
humaneval.yaml Update (#2978) 2025-03-23 16:39:35 +08:00
json_mode_eval.yaml [TRTLLM-7028][feat] Enable guided decoding with speculative decoding (part 2: one-model engine) (#6948) 2025-09-03 15:16:11 -07:00
longbench_v2.yaml [TRTLLM-8948][test] Add long bench case (#9165) 2025-11-18 04:41:48 -08:00
mmlu.yaml [https://nvbugs/5680905][fix] Relax the MMLU accuracy requirement for DS-v3.2 (#9439) 2025-11-26 00:32:20 +08:00
mmmu.yaml [TRTLLM-9264][fix] Add accuracy/unit tests/doc for phi4mm (#9246) 2025-11-26 11:12:35 +08:00
passkey_retrieval_64k.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
passkey_retrieval_128k.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
SlimPajama-6B.yaml test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982) 2025-03-25 07:34:10 +08:00
zero_scrolls.yaml Update (#2978) 2025-03-23 16:39:35 +08:00