TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

William Zhang d15dcdc4ae [https://nvbugs/5448525 ][fix] Mistral Small 3.1 accuracy tests (#6909 ) This commit lowers the GPU memory allocated for KV cache in accuracy tests, and adjusts a threshold for Mistral Small 3.1 24B for FP8. Signed-off-by: William Zhang <133824995+2ez4bz@users.noreply.github.com> Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>		2025-09-01 11:02:31 +08:00
..
cnn_dailymail.yaml	[https://nvbugs/5448525 ][fix] Mistral Small 3.1 accuracy tests (#6909 )	2025-09-01 11:02:31 +08:00
gpqa_diamond.yaml	[https://nvbugs/5409414 ][fix] fix Not registered specs (#6660 )	2025-08-07 17:55:53 +10:00
gsm8k.yaml	[TRTLLM-5252][feat] Add fp8 support for Mistral Small 3.1 (#6731 )	2025-09-01 11:02:31 +08:00
humaneval.yaml	Update (#2978 )	2025-03-23 16:39:35 +08:00
json_mode_eval.yaml	[TRTLLM-6409][feat] Enable guided decoding with speculative decoding (part 1: two-model engine) (#6300 )	2025-08-07 05:53:48 -04:00
mmlu.yaml	[TRTLLM-5252][feat] Add fp8 support for Mistral Small 3.1 (#6731 )	2025-09-01 11:02:31 +08:00
mmmu.yaml	[TRTLLM-6771][feat] Support MMMU for multimodal models (#6828 )	2025-08-21 08:54:12 +08:00
passkey_retrieval_64k.yaml	test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982 )	2025-03-25 07:34:10 +08:00
passkey_retrieval_128k.yaml	test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982 )	2025-03-25 07:34:10 +08:00
SlimPajama-6B.yaml	test: Accuracy test improvement (Part 2): Incorporate mmlu to accuracy test suite (#2982 )	2025-03-25 07:34:10 +08:00
zero_scrolls.yaml	Update (#2978 )	2025-03-23 16:39:35 +08:00