mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-02-19 01:05:12 +08:00
[https://nvbugs/5625962][chore] unwaive DS-v32-fp4 tests (#8853)
Signed-off-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
This commit is contained in:
parent
00c0e6c440
commit
e9f78c687a
@ -198,6 +198,11 @@ deepseek-ai/DeepSeek-V3.2-Exp:
|
||||
- quant_algo: FP8_BLOCK_SCALES
|
||||
spec_dec_algo: MTP
|
||||
accuracy: 88.2
|
||||
- quant_algo: NVFP4
|
||||
accuracy: 87.81
|
||||
- quant_algo: NVFP4
|
||||
spec_dec_algo: MTP
|
||||
accuracy: 87.81
|
||||
Qwen3/Qwen3-8B:
|
||||
- quant_algo: W4A8_MXFP4_FP8
|
||||
accuracy: 72.70
|
||||
|
||||
@ -394,5 +394,3 @@ accuracy/test_disaggregated_serving.py::TestQwen3_30B_A3B::test_mixed_ctx_gen_mo
|
||||
accuracy/test_disaggregated_serving.py::TestGPTOSS::test_auto_dtype[True] SKIP (https://nvbugs/5624367)
|
||||
accuracy/test_disaggregated_serving.py::TestGPTOSS::test_auto_dtype[False] SKIP (https://nvbugs/5624367)
|
||||
disaggregated/test_disaggregated.py::test_disaggregated_benchmark_on_diff_backends[llama-v3-8b-hf] SKIP (https://nvbugs/5587574)
|
||||
accuracy/test_llm_api_pytorch.py::TestDeepSeekV32::test_nvfp4_multi_gpus[baseline] SKIP (https://nvbugs/5625962)
|
||||
accuracy/test_llm_api_pytorch.py::TestDeepSeekV32::test_nvfp4_multi_gpus[baseline_mtp1] SKIP (https://nvbugs/5625962)
|
||||
|
||||
Loading…
Reference in New Issue
Block a user