mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-14 06:27:45 +08:00
Signed-off-by: Ruodi <200874449+ruodil@users.noreply.github.com> Signed-off-by: Larry <197874197+LarryXFly@users.noreply.github.com> Co-authored-by: Larry <197874197+LarryXFly@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| allowed_configs.py | ||
| build.py | ||
| data_export.py | ||
| data.py | ||
| gpu_clock_lock.py | ||
| misc.py | ||
| model_yaml_config.py | ||
| README.md | ||
| sanity_perf_check.py | ||
| session_data_writer.py | ||
| test_perf.py | ||
| utils.py | ||
Sanity Perf Check Introduction
Background
The sanity perf check mechanism is the way of perf regression detection for L0 testing. We create the base_perf.csv which consists of the several models' perf baseline and use the sanity_perf_check.py to detect the perf regression.
Usage
There're four typical scenarios for sanity perf check feature.
- The newly added MR doesn't impact the models' perf, the perf check will pass w/o exception.
- The newly added MR introduces the new model into perf model list. The sanity check will trigger the exception and the author of this MR needs to add the perf into base_perf.csv.
- The newly added MR improves the existed models' perf and the MR author need to refresh the base_perf.csv data w/ new baseline.
- The newly added MR introduces the perf regression and the MR author needs to fix the issue and rerun the pipeline.