TensorRT-LLMs/tensorrt_llm/evaluate
2025-12-31 09:22:54 -08:00
..
lm_eval_tasks/gpqa/cot_zeroshot_aa
__init__.py
cnn_dailymail.py
interface.py
json_mode_eval.py
lm_eval.py [https://nvbugs/5717993][fix] Add execution_stream across PyExecutor, KVCacheManager, PeftCacheManager to ensure proper CUDA stream synchronization between KV cache transfer operations and model forward kernels. (#10060) 2025-12-31 09:22:54 -08:00
longbench_v2.py
mmlu.py