TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-28 06:33:15 +08:00

History

dongfengy 5a01f382c1 [https://nvbugs/5575913 ][fix] Use separate thresholds for 120b/20b gptoss (#8664 ) Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com> Signed-off-by: dongfengy <99041270+dongfengy@users.noreply.github.com>		2025-10-28 10:35:07 -04:00
..
cnn_dailymail.yaml	[None][feat] Optimize MLA kernels with separate reduction kernels (#7597 )	2025-09-09 16:58:44 +08:00
gpqa_diamond.yaml	[TRTLLM-8535][feat] Support DeepSeek V3.2 with FP8 + BF16 KV cache/NVFP4 + BF16 KV cache (#8405 )	2025-10-24 13:40:41 -04:00
gsm8k.yaml	[https://nvbugs/5575913 ][fix] Use separate thresholds for 120b/20b gptoss (#8664 )	2025-10-28 10:35:07 -04:00
humaneval.yaml	Update (#2978 )	2025-03-23 16:39:35 +08:00
json_mode_eval.yaml	[TRTLLM-7028][feat] Enable guided decoding with speculative decoding (part 2: one-model engine) (#6948 )	2025-09-03 15:16:11 -07:00
mmlu.yaml	[TRTLLM-8535][feat] Support DeepSeek V3.2 with FP8 + BF16 KV cache/NVFP4 + BF16 KV cache (#8405 )	2025-10-24 13:40:41 -04:00
mmmu.yaml	[TRTLLM-6577][feat] Support nano_v2_vlm in pytorch backend (#7207 )	2025-09-18 16:26:20 +08:00
passkey_retrieval_64k.yaml	test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982 )	2025-03-25 07:34:10 +08:00
passkey_retrieval_128k.yaml	test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982 )	2025-03-25 07:34:10 +08:00
SlimPajama-6B.yaml	test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982 )	2025-03-25 07:34:10 +08:00
zero_scrolls.yaml	Update (#2978 )	2025-03-23 16:39:35 +08:00