| .. |
|
accuracy
|
[feat] Add TRTLLM MoE nvfp4 cubins for mid-high concurrency; attention_dp for TRTLLM MoE (#5723)
|
2025-07-10 14:06:50 +08:00 |
|
cpp
|
Fix GEMM+AR fusion on blackwell (#5563)
|
2025-07-09 08:48:47 +08:00 |
|
deterministic
|
chore: Cleanup deprecated APIs from LLM-API (part 1/2) (#3732)
|
2025-05-07 13:20:25 +08:00 |
|
disaggregated
|
[TRTLLM-5530] chore: rename LLM.autotuner_enabled to enable_autotuner (#5876)
|
2025-07-10 11:31:35 +08:00 |
|
examples
|
Fix GEMM+AR fusion on blackwell (#5563)
|
2025-07-09 08:48:47 +08:00 |
|
llmapi
|
chore [TRTLLM-6161]: add LLM speculative decoding example (#5706)
|
2025-07-09 07:33:11 +08:00 |
|
perf
|
test: Validate and add accuracy& perf tests for Ministral-8B-Instruct[-FP8](pytorch only) (#5654)
|
2025-07-08 18:16:21 -07:00 |
|
stress_test
|
Revert "chore: [Breaking Change] Rename cuda_graph_config padding_enabled fie…" (#5818)
|
2025-07-08 13:15:30 +09:00 |
|
sysinfo
|
Update (#2978)
|
2025-03-23 16:39:35 +08:00 |
|
triton_server
|
[nvbugs/5309940] Add support for input output token counts (#5445)
|
2025-06-28 04:39:39 +08:00 |
|
__init__.py
|
Update (#2978)
|
2025-03-23 16:39:35 +08:00 |
|
.test_durations
|
[NVBUG-5304516/5319741]Qwen2.5VL FP8 support (#5029)
|
2025-07-09 23:16:42 +08:00 |
|
agg_unit_mem_df.csv
|
test: reorganize tests folder hierarchy (#2996)
|
2025-03-27 12:07:53 +08:00 |
|
ci_profiler.py
|
Update (#2978)
|
2025-03-23 16:39:35 +08:00 |
|
common.py
|
ReDrafter support for Qwen (#4875)
|
2025-06-28 02:33:10 +08:00 |
|
conftest.py
|
[TRTLLM-5366][feat]Add support for sm121 (#5524)
|
2025-07-08 14:27:00 -07:00 |
|
local_venv.py
|
tests: https://nvbugs/5219534 remove failed tests from test list (#4113)
|
2025-05-12 14:13:40 +08:00 |
|
pytest.ini
|
test: Add fixture to skip tests based on MPI world size (#5028)
|
2025-06-16 11:25:01 +08:00 |
|
runner_interface.py
|
Update (#2978)
|
2025-03-23 16:39:35 +08:00 |
|
test_cache.py
|
chore: clean some ci of qa test (#3083)
|
2025-03-31 14:30:41 +08:00 |
|
test_cases.yml
|
Update (#2978)
|
2025-03-23 16:39:35 +08:00 |
|
test_e2e.py
|
feat(models): Mistral3.1 VLM pytorch backend support (#5529)
|
2025-07-09 13:17:40 -07:00 |
|
test_list_parser.py
|
[TRTLLM-4535][infra]: Add marker TIMEOUT for test level (#3905)
|
2025-05-25 23:30:40 -07:00 |
|
test_list_validation.py
|
[Infra]Remove some old keyword (#4552)
|
2025-05-31 13:50:45 +08:00 |
|
test_mlpf_results.py
|
Update (#2978)
|
2025-03-23 16:39:35 +08:00 |
|
test_rerun.py
|
[Infra][TRTLLM-3929] Rerun failure tests (#3264)
|
2025-05-27 16:13:23 +08:00 |
|
test_sanity.py
|
Update (#2978)
|
2025-03-23 16:39:35 +08:00 |
|
test_unittests.py
|
[fix][ci] correct unittests test prefix (#5547)
|
2025-06-27 20:34:44 +08:00 |
|
trt_test_alternative.py
|
chore: improve disagg test failure detection (#4738)
|
2025-06-15 01:28:26 +08:00 |