TensorRT-LLMs/cpp/kernels/fmha_v2/test/conftest.py
qsang-nv 0fd59d64ab
infra: open source fmha v2 kernels (#4185)
* add fmha repo

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>

* fix format

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>

* fix code style

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>

* fix header

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>

* fix header kernel_traits.h

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>

* add .gitignore file

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>

* add SLIDING_WINDOW_ATTENTION

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>

* fix style

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>

* fix format

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>

* update setup.py

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>

* update build_wheel.py

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>

---------

Signed-off-by: Qidi Sang <200703406+qsang-nv@users.noreply.github.com>
Signed-off-by: qsang-nv <200703406+qsang-nv@users.noreply.github.com>
2025-05-15 10:56:34 +08:00

50 lines
1.5 KiB
Python

# SPDX-FileCopyrightText: Copyright (c) 2023-2024 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
# SPDX-License-Identifier: NVIDIA TensorRT Source Code License Agreement
#
# NVIDIA CORPORATION, its affiliates and licensors retain all intellectual
# property and proprietary rights in and to this material, related
# documentation and any modifications thereto. Any use, reproduction,
# disclosure or distribution of this material and related documentation
# without an express license agreement from NVIDIA CORPORATION or
# its affiliates is strictly prohibited.
import subprocess
import pytest
def pytest_addoption(parser):
parser.addoption("--dry-run",
"--dryrun",
action="store_true",
help="dry run")
parser.addoption("--disable-rules",
"--disable_rules",
action="store_true",
help="disable filtering of test prompts")
@pytest.fixture
def dryrun(pytestconfig):
return pytestconfig.getoption("dry_run")
@pytest.fixture
def disable_rules(pytestconfig):
# - converted internally to _ in argparse
return pytestconfig.getoption("disable_rules")
@pytest.fixture
def gpu_compute_cap():
output = subprocess.check_output(
['nvidia-smi', "--query-gpu=compute_cap", "--format=csv"])
csv_header, csv_value, *other_csv_values = output.splitlines()
return int(float(csv_value) * 10)
# debug
@pytest.fixture
def mock_gpu_compute_cap():
return 30