TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-26 21:53:30 +08:00

History

Omer Ullman Argov 1db63c2546 [fix] speedup modeling unittests (#5579 ) Signed-off-by: Omer Ullman Argov <118735753+omera-nv@users.noreply.github.com>		2025-06-30 06:30:45 +03:00
..
test_modeling_bert.py	feat: no-cache attention in PyTorch workflow (#3085 )	2025-04-05 01:54:32 +08:00
test_modeling_clip.py	feat: add Pytorch support of Vision Encoder for multimodal models (#3791 )	2025-05-03 05:13:47 +08:00
test_modeling_deepseek.py	[TRTLLM-5530][BREAKING CHANGE]: enhance the llm args pytorch config part 1(cuda_graph_config) (#5014 )	2025-06-30 11:05:40 +08:00
test_modeling_llama_min_latency.py	[fix] speedup modeling unittests (#5579 )	2025-06-30 06:30:45 +03:00
test_modeling_llama.py	[fix] speedup modeling unittests (#5579 )	2025-06-30 06:30:45 +03:00
test_modeling_mixtral.py	[fix] speedup modeling unittests (#5579 )	2025-06-30 06:30:45 +03:00
test_modeling_mllama.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
test_modeling_nemotron_h.py	[TRTLLM-5530][BREAKING CHANGE]: enhance the llm args pytorch config part 1(cuda_graph_config) (#5014 )	2025-06-30 11:05:40 +08:00
test_modeling_nemotron_nas.py	[fix][test] Speedup Nemotron NAS unittests (#5202 )	2025-06-15 11:26:03 +03:00
test_modeling_nemotron.py	[fix] speedup modeling unittests (#5579 )	2025-06-30 06:30:45 +03:00
test_modeling_out_of_tree.py	[TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (#5312 )	2025-06-20 03:01:10 +08:00
test_modeling_qwen_moe.py	feat: add qwen2 moe to torch flow; fix wrong imported KvCacheConfig in gpqa… (#3369 )	2025-04-10 22:45:57 +08:00
test_modeling_qwen.py	feat: no-cache attention in PyTorch workflow (#3085 )	2025-04-05 01:54:32 +08:00
test_modeling_siglip.py	feat: add Pytorch support of Vision Encoder for multimodal models (#3791 )	2025-05-03 05:13:47 +08:00
test_modeling_vila.py	feat: llama4 input processor (#3383 )	2025-04-25 16:47:14 -07:00