TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Simeng Liu 84d107b2f0 [https://nvbugs/5717993 ][fix] Add execution_stream across PyExecutor, KVCacheManager, PeftCacheManager to ensure proper CUDA stream synchronization between KV cache transfer operations and model forward kernels. (#10060 ) Signed-off-by: SimengLiu-nv <simengl@nvidia.com>		2025-12-31 09:22:54 -08:00
..
lm_eval_tasks/gpqa/cot_zeroshot_aa	test [TRTLLM-4477,TRTLLM-4481]: Accuracy test improvement (Part 3.5): Support GSM8K and GPQA (#3483 )	2025-04-22 07:38:16 +08:00
__init__.py	[None][infra] Add LongBenchV1 to trtllm-eval. (#10265 )	2025-12-30 21:39:34 +08:00
cnn_dailymail.py	[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (#5312 )	2025-06-20 03:01:10 +08:00
interface.py	[None][test] Add post merge test for Seed-OSS-36B-Instruct (#8321 )	2025-10-17 02:30:33 -07:00
json_mode_eval.py	[TRTLLM-8269][test] do not explicitly pass temperature=0 to select greedy sampling (#8110 )	2025-10-02 10:20:32 +02:00
lm_eval.py	[https://nvbugs/5717993 ][fix] Add execution_stream across PyExecutor, KVCacheManager, PeftCacheManager to ensure proper CUDA stream synchronization between KV cache transfer operations and model forward kernels. (#10060 )	2025-12-31 09:22:54 -08:00
longbench_v2.py	[TRTLLM-9805][feat] Skip Softmax Attention. (#9821 )	2025-12-21 02:52:42 -05:00
mmlu.py	[https://nvbugs/4141427 ][chore] Add more details to LICENSE file (#9881 )	2025-12-13 08:35:31 +08:00