TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

xiweny c076a02b38 [TRTLLM-4629] [feat] Add support of CUDA13 and sm103 devices (#7568 ) Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com> Signed-off-by: Tian Zheng <29906817+Tom-Zheng@users.noreply.github.com> Signed-off-by: Daniel Stokes <dastokes@nvidia.com> Signed-off-by: Zhanrui Sun <zhanruis@nvidia.com> Signed-off-by: Xiwen Yu <xiweny@nvidia.com> Signed-off-by: Jiagan Cheng <jiaganc@nvidia.com> Signed-off-by: Yiqing Yan <yiqingy@nvidia.com> Signed-off-by: Bo Deng <deemod@nvidia.com> Signed-off-by: ZhanruiSunCh <184402041+ZhanruiSunCh@users.noreply.github.com> Signed-off-by: xiweny <13230610+VALLIS-NERIA@users.noreply.github.com> Co-authored-by: Tian Zheng <29906817+Tom-Zheng@users.noreply.github.com> Co-authored-by: Daniel Stokes <dastokes@nvidia.com> Co-authored-by: Zhanrui Sun <zhanruis@nvidia.com> Co-authored-by: Jiagan Cheng <jiaganc@nvidia.com> Co-authored-by: Yiqing Yan <yiqingy@nvidia.com> Co-authored-by: Bo Deng <deemod@nvidia.com> Co-authored-by: Zhanrui Sun <184402041+ZhanruiSunCh@users.noreply.github.com>		2025-09-16 09:56:18 +08:00
..
accuracy	[TRTLLM-4629] [feat] Add support of CUDA13 and sm103 devices (#7568 )	2025-09-16 09:56:18 +08:00
cpp	[None] [ci] Reorganize CMake and Python integration test infrastructure for C++ tests (#6754 )	2025-08-24 20:53:17 +02:00
deterministic	chore: Cleanup deprecated APIs from LLM-API (part 1/2) (#3732 )	2025-05-07 13:20:25 +08:00
disaggregated	[TRTLLM-4629] [feat] Add support of CUDA13 and sm103 devices (#7568 )	2025-09-16 09:56:18 +08:00
examples	[https://nvbugs/5492485 ][fix] Use offline dataset from llm-models instead. (#7435 )	2025-09-04 09:58:16 -07:00
llmapi	[None][doc] add legacy section for tensorrt engine (#6724 )	2025-09-01 11:02:31 +08:00
perf	[TRTLLM-7871][infra] Extend test_perf.py to add disagg-serving perf tests. (#7503 )	2025-09-10 17:35:51 +08:00
stress_test	[BREAKING CHANGE]: change default backend to PyTorch in trtllm-serve (#5717 )	2025-07-21 21:09:43 +08:00
sysinfo	[None][fix] Migrate to new cuda binding package name (#6700 )	2025-08-07 16:29:55 -04:00
triton_server	[TRTLLM-5950][infra] Removing remaining turtle keywords from the code base (#7086 )	2025-09-07 14:26:18 +08:00
utils	[https://nvbugs/5410279 ][test] resubmit timeout refactor (#6337 )	2025-08-05 16:39:25 +08:00
__init__.py	[fix] Remove SpecConfig and fix thread leak issues (#5931 )	2025-07-12 21:03:24 +09:00
.test_durations	[TRTLLM-7457][ci] Update unittest parallel config (#7297 )	2025-08-29 09:28:04 +08:00
agg_unit_mem_df.csv	[None][ci] set TORCHINDUCTOR_COMPILE_THREADS for thop/parallel tests (#7489 )	2025-09-04 06:04:51 -07:00
ci_profiler.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
common.py	[TRTLLM-7245][feat] add test_multi_nodes_eval tests (#7108 )	2025-08-22 17:17:27 +08:00
conftest.py	[TRTLLM-7876][test] Test trtllm-serve with --extra_llm_api_options (#7492 )	2025-09-04 10:34:38 +08:00
local_venv.py	[TRTLLM-5950][infra] Removing remaining turtle keywords from the code base (#7086 )	2025-09-07 14:26:18 +08:00
pytest.ini	test: fix PytestUnknownMarkWarning: Unknown pytest.mark.timeout (#6115 )	2025-07-17 20:55:04 +10:00
runner_interface.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_cache.py	chore: clean some ci of qa test (#3083 )	2025-03-31 14:30:41 +08:00
test_cases.yml	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_e2e.py	[TRTLLM-6903][feat] Support chunked prefill for multimodal models (#6843 )	2025-09-14 20:10:10 -07:00
test_fmha.py	[TRTLLM-5877][infra] Add fmha tests and auto trigger rules (#6050 )	2025-09-09 11:33:09 +08:00
test_list_parser.py	[TRTLLM-4535][infra]: Add marker TIMEOUT for test level (#3905 )	2025-05-25 23:30:40 -07:00
test_list_validation.py	[Infra]Remove some old keyword (#4552 )	2025-05-31 13:50:45 +08:00
test_mlpf_results.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_sanity.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_unittests.py	[TRTLLM-4629] [feat] Add support of CUDA13 and sm103 devices (#7568 )	2025-09-16 09:56:18 +08:00
trt_test_alternative.py	[TRTLLM-7457][ci] Update & cleanup unittest parallel config (#7254 )	2025-08-27 00:45:58 -04:00