TensorRT-LLMs/tests/integration/defs
ruodil 3d9a2b5eb7
test: remove enable_overlap_schedule in pytorch config and set enable_chunked prefill to be true for isl>2048 cases (#4285)
1.remove enable_overlap_schedule in pytorch config
2.rename model_yaml_config.py to pytorch_model_config.py and set enable_chunked_prefill to be true for cases with isl>2048

Signed-off-by: Ruodi <200874449+ruodil@users.noreply.github.com>
Co-authored-by: Larry <197874197+LarryXFly@users.noreply.github.com>
2025-05-21 14:26:56 +08:00
..
accuracy Chore: waive torch compile test cases of deepseek v3 lite (#4508) 2025-05-21 10:43:31 +08:00
cpp test: NIXL single process test (#4486) 2025-05-21 10:41:46 +08:00
deterministic chore: Cleanup deprecated APIs from LLM-API (part 1/2) (#3732) 2025-05-07 13:20:25 +08:00
disaggregated unwaive some disagg test (#4476) 2025-05-21 11:45:11 +08:00
examples test: FIX test_ptp_quickstart_advanced_deepseek_v3_2nodes_8gpus (#4283) 2025-05-15 15:57:44 +08:00
llmapi chore: refactor llmapi e2e tests (#3803) 2025-05-05 07:37:24 +08:00
perf test: remove enable_overlap_schedule in pytorch config and set enable_chunked prefill to be true for isl>2048 cases (#4285) 2025-05-21 14:26:56 +08:00
stress_test Breaking change: perf: Enable scheduling overlap by default (#4174) 2025-05-15 14:27:36 +08:00
sysinfo Update (#2978) 2025-03-23 16:39:35 +08:00
triton_server Remove vila test (#4376) 2025-05-19 09:02:39 +08:00
__init__.py Update (#2978) 2025-03-23 16:39:35 +08:00
.test_durations [https://nvbugs/5123103][fix] Fix torch compile for DeepSeekV3 (#3952) 2025-05-19 22:12:25 +08:00
agg_unit_mem_df.csv test: reorganize tests folder hierarchy (#2996) 2025-03-27 12:07:53 +08:00
ci_profiler.py Update (#2978) 2025-03-23 16:39:35 +08:00
common.py Add test case for kv memory estimation (#4158) 2025-05-14 18:39:25 +08:00
conftest.py Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
local_venv.py tests: https://nvbugs/5219534 remove failed tests from test list (#4113) 2025-05-12 14:13:40 +08:00
pytest.ini Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
runner_interface.py Update (#2978) 2025-03-23 16:39:35 +08:00
test_cache.py chore: clean some ci of qa test (#3083) 2025-03-31 14:30:41 +08:00
test_cases.yml Update (#2978) 2025-03-23 16:39:35 +08:00
test_e2e.py tests: add llama 3.3 70b 2 nodes tests (#4391) 2025-05-21 12:42:45 +08:00
test_list_parser.py infra: Add test list name check (#3097) 2025-04-20 23:02:16 +08:00
test_list_validation.py Update (#2978) 2025-03-23 16:39:35 +08:00
test_mlpf_results.py Update (#2978) 2025-03-23 16:39:35 +08:00
test_sanity.py Update (#2978) 2025-03-23 16:39:35 +08:00
test_unittests.py [TRTLLM-4886][infra]Try another timeout opt to exit test thread directly instead of gracefully (#4341) 2025-05-16 17:56:40 +08:00
trt_test_alternative.py Add test case for kv memory estimation (#4158) 2025-05-14 18:39:25 +08:00