mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-02-06 03:01:50 +08:00
[https://nvbugs/5776445][chore] unwaive test (#10667)
Signed-off-by: Lizhi Zhou <1432185+reasonsolo@users.noreply.github.com>
This commit is contained in:
parent
066fa4cd93
commit
c6320d924d
@ -120,6 +120,7 @@ l0_dgx_h100:
|
||||
- accuracy/test_llm_api_pytorch.py::TestQwen3NextInstruct::test_bf16_4gpu[tp4ep4_cudagraph_overlap]
|
||||
- disaggregated/test_auto_scaling.py::test_service_discovery[etcd-round_robin]
|
||||
- disaggregated/test_auto_scaling.py::test_worker_restart[etcd-load_balancing]
|
||||
- disaggregated/test_auto_scaling.py::test_worker_restart[etcd-round_robin]
|
||||
- disaggregated/test_auto_scaling.py::test_minimal_instances[etcd-round_robin]
|
||||
- disaggregated/test_auto_scaling.py::test_disagg_server_restart[etcd-round_robin]
|
||||
- disaggregated/test_auto_scaling.py::test_service_discovery[http-round_robin]
|
||||
|
||||
@ -311,7 +311,6 @@ accuracy/test_disaggregated_serving.py::TestLlama3_1_8BInstruct::test_ctx_pp_gen
|
||||
accuracy/test_llm_api_pytorch.py::TestDeepSeekV3Lite::test_bfloat16_4gpus[ep4-mtp_nextn=0-attention_dp=False-cuda_graph=False-overlap_scheduler=False-torch_compile=False] SKIP (https://nvbugs/5701445)
|
||||
accuracy/test_disaggregated_serving.py::TestDeepSeekV3Lite::test_guided_decoding[llguidance-mtp_nextn=0] SKIP (https://nvbugs/5748600)
|
||||
unittest/_torch/ray_orchestrator/multi_gpu/test_multi_instance.py::test_multi_instance[tp2_2instances] SKIP (https://nvbugs/5784566)
|
||||
disaggregated/test_auto_scaling.py::test_worker_restart[etcd-round_robin] SKIP (https://nvbugs/5776445)
|
||||
accuracy/test_llm_api_pytorch.py::TestGPTOSS::test_eagle3_vswa_reuse_4gpus[one_model] SKIP (https://nvbugs/5756028)
|
||||
accuracy/test_llm_api_pytorch.py::TestGPTOSS::test_eagle3_vswa_reuse_4gpus[two_model] SKIP (https://nvbugs/5756028)
|
||||
accuracy/test_llm_api_pytorch.py::TestQwen3_30B_A3B::test_fp8[latency-torch_compile=False] SKIP (https://nvbugs/5785206)
|
||||
|
||||
Loading…
Reference in New Issue
Block a user