[https://nvbugs/5625962][chore] unwaive DS-v32-fp4 tests (#8853)

Signed-off-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
This commit is contained in:
Fanrong Li 2025-11-03 16:34:52 +08:00 committed by GitHub
parent 00c0e6c440
commit e9f78c687a
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
2 changed files with 5 additions and 2 deletions

View File

@ -198,6 +198,11 @@ deepseek-ai/DeepSeek-V3.2-Exp:
- quant_algo: FP8_BLOCK_SCALES
spec_dec_algo: MTP
accuracy: 88.2
- quant_algo: NVFP4
accuracy: 87.81
- quant_algo: NVFP4
spec_dec_algo: MTP
accuracy: 87.81
Qwen3/Qwen3-8B:
- quant_algo: W4A8_MXFP4_FP8
accuracy: 72.70

View File

@ -394,5 +394,3 @@ accuracy/test_disaggregated_serving.py::TestQwen3_30B_A3B::test_mixed_ctx_gen_mo
accuracy/test_disaggregated_serving.py::TestGPTOSS::test_auto_dtype[True] SKIP (https://nvbugs/5624367)
accuracy/test_disaggregated_serving.py::TestGPTOSS::test_auto_dtype[False] SKIP (https://nvbugs/5624367)
disaggregated/test_disaggregated.py::test_disaggregated_benchmark_on_diff_backends[llama-v3-8b-hf] SKIP (https://nvbugs/5587574)
accuracy/test_llm_api_pytorch.py::TestDeepSeekV32::test_nvfp4_multi_gpus[baseline] SKIP (https://nvbugs/5625962)
accuracy/test_llm_api_pytorch.py::TestDeepSeekV32::test_nvfp4_multi_gpus[baseline_mtp1] SKIP (https://nvbugs/5625962)