mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-13 22:18:36 +08:00
Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
This commit is contained in:
parent
a65b0d4efa
commit
e2f5455533
@ -43,3 +43,4 @@ l0_perf:
|
||||
backend: pytorch
|
||||
tests:
|
||||
- perf/test_perf.py::test_perf[llama_v3.1_8b_instruct-bench-_autodeploy-float16-input_output_len:128,128-reqs:8192]
|
||||
- perf/test_perf.py::test_perf[deepseek_r1_distill_qwen_32b-bench-_autodeploy-float16-input_output_len:1024,1024-reqs:512]
|
||||
|
||||
Loading…
Reference in New Issue
Block a user