TensorRT-LLMs/tests/unittest/_torch/modeling
Omer Ullman Argov 1db63c2546
[fix] speedup modeling unittests (#5579)
Signed-off-by: Omer Ullman Argov <118735753+omera-nv@users.noreply.github.com>
2025-06-30 06:30:45 +03:00
..
test_modeling_bert.py feat: no-cache attention in PyTorch workflow (#3085) 2025-04-05 01:54:32 +08:00
test_modeling_clip.py feat: add Pytorch support of Vision Encoder for multimodal models (#3791) 2025-05-03 05:13:47 +08:00
test_modeling_deepseek.py [TRTLLM-5530][BREAKING CHANGE]: enhance the llm args pytorch config part 1(cuda_graph_config) (#5014) 2025-06-30 11:05:40 +08:00
test_modeling_llama_min_latency.py [fix] speedup modeling unittests (#5579) 2025-06-30 06:30:45 +03:00
test_modeling_llama.py [fix] speedup modeling unittests (#5579) 2025-06-30 06:30:45 +03:00
test_modeling_mixtral.py [fix] speedup modeling unittests (#5579) 2025-06-30 06:30:45 +03:00
test_modeling_mllama.py test: reorganize tests folder hierarchy (#2996) 2025-03-27 12:07:53 +08:00
test_modeling_nemotron_h.py [TRTLLM-5530][BREAKING CHANGE]: enhance the llm args pytorch config part 1(cuda_graph_config) (#5014) 2025-06-30 11:05:40 +08:00
test_modeling_nemotron_nas.py [fix][test] Speedup Nemotron NAS unittests (#5202) 2025-06-15 11:26:03 +03:00
test_modeling_nemotron.py [fix] speedup modeling unittests (#5579) 2025-06-30 06:30:45 +03:00
test_modeling_out_of_tree.py [TRTLLM-5208][BREAKING CHANGE] chore: make pytorch LLM the default (#5312) 2025-06-20 03:01:10 +08:00
test_modeling_qwen_moe.py feat: add qwen2 moe to torch flow; fix wrong imported KvCacheConfig in gpqa… (#3369) 2025-04-10 22:45:57 +08:00
test_modeling_qwen.py feat: no-cache attention in PyTorch workflow (#3085) 2025-04-05 01:54:32 +08:00
test_modeling_siglip.py feat: add Pytorch support of Vision Encoder for multimodal models (#3791) 2025-05-03 05:13:47 +08:00
test_modeling_vila.py feat: llama4 input processor (#3383) 2025-04-25 16:47:14 -07:00