TensorRT-LLMs/tensorrt_llm/evaluate
heyuhhh dfac07c045
[None][feat] Support to export data in trtllm-eval (#10075)
Signed-off-by: yuhangh <58161490+heyuhhh@users.noreply.github.com>
2026-01-15 23:27:08 +08:00
..
lm_eval_tasks/gpqa/cot_zeroshot_aa test [TRTLLM-4477,TRTLLM-4481]: Accuracy test improvement (Part 3.5): Support GSM8K and GPQA (#3483) 2025-04-22 07:38:16 +08:00
__init__.py [None][infra] Add LongBenchV1 to trtllm-eval. (#10265) 2025-12-30 21:39:34 +08:00
cnn_dailymail.py [None][feat] Support to export data in trtllm-eval (#10075) 2026-01-15 23:27:08 +08:00
interface.py [None][feat] Support to export data in trtllm-eval (#10075) 2026-01-15 23:27:08 +08:00
json_mode_eval.py [None][feat] Support to export data in trtllm-eval (#10075) 2026-01-15 23:27:08 +08:00
lm_eval.py [None][feat] Support to export data in trtllm-eval (#10075) 2026-01-15 23:27:08 +08:00
longbench_v2.py [None][feat] Support to export data in trtllm-eval (#10075) 2026-01-15 23:27:08 +08:00
mmlu.py [None][feat] Support to export data in trtllm-eval (#10075) 2026-01-15 23:27:08 +08:00