TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-19 01:05:12 +08:00

History

Bala Marimuthu 1c065fbb3e [#11109 ][feat] AutoDeploy: GLM 4.7 Flash Improvements (#11414 ) Signed-off-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com> Signed-off-by: Balamurugan Marimuthu <246387390+bmarimuthu-nv@users.noreply.github.com> Signed-off-by: Gal Hubara Agam <96368689+galagam@users.noreply.github.com> Signed-off-by: greg-kwasniewski1 <213329731+greg-kwasniewski1@users.noreply.github.com> Signed-off-by: Gal Hubara-Agam <96368689+galagam@users.noreply.github.com> Co-authored-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com> Co-authored-by: Gal Hubara Agam <96368689+galagam@users.noreply.github.com> Co-authored-by: Grzegorz Kwasniewski <213329731+greg-kwasniewski1@users.noreply.github.com>		2026-02-17 08:43:59 -05:00
..
accuracy	[#11109 ][feat] AutoDeploy: GLM 4.7 Flash Improvements (#11414 )	2026-02-17 08:43:59 -05:00
cpp	[https://nvbugs/5630196 ] [fix] Prevent flaky failures in C++ test_e2e.py by using local cached datasets for benchmarking (#10638 )	2026-01-14 21:39:55 -05:00
deterministic	[https://nvbugs/4141427 ][chore] Add more details to LICENSE file (#9881 )	2025-12-13 08:35:31 +08:00
disaggregated	[https://nvbugs/5821433 ][fix] complete WAR for popen in QA env (#11214 )	2026-02-15 19:57:03 +08:00
examples	[#11109 ][feat] AutoDeploy: GLM 4.7 Flash Improvements (#11414 )	2026-02-17 08:43:59 -05:00
llmapi	[None][feat] Add priority-based KV cache offload filtering support (#10751 )	2026-02-05 05:22:56 -05:00
perf	[None][chore] Fix slurm job name (#11265 )	2026-02-15 19:57:03 +08:00
ray_orchestrator/RL	[TRTLLM-9737][chore] Add rl perf reproduce script and enhance the robustness of Ray tests (#9939 )	2025-12-24 15:27:01 +08:00
stress_test	[https://nvbugs/5823465 ][fix] Add CUTEDSL moe backend for deepseek r1 nvfp4 checkpoint in stress test (#10920 )	2026-02-15 19:57:03 +08:00
sysinfo	[None][infra] Enable single-gpu CI on spark (#9304 )	2025-12-30 17:22:14 +08:00
thirdparty	[TRTLLM-9228][infra] Verify thirdparty C++ process (#9367 )	2025-12-10 21:01:19 +08:00
triton_server	[None][feat] Expose enable_trt_overlap in Triton_backend brings 1.05x OTPS (#10018 )	2025-12-23 11:41:31 -06:00
utils	[TRTLLM-9711][infra] Fix the testcase name in timeout xml (#9781 )	2026-02-10 18:50:42 +08:00
__init__.py	[fix] Remove SpecConfig and fix thread leak issues (#5931 )	2025-07-12 21:03:24 +09:00
.test_durations	[TRTLLM-9766][feat] Integration of the KVCacheManager V2 to TRTLLM Runtime (#10659 )	2026-02-02 14:29:02 +08:00
agg_unit_mem_df.csv	[None][chore] Move test_trtllm_flashinfer_symbol_collision.py to tests/unittest/_torch (#11168 )	2026-02-09 13:57:57 +08:00
ci_profiler.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
common.py	[https://nvbugs/5760726 ][fix] Use random port in container port section (#10432 )	2026-01-06 23:25:46 +08:00
conftest.py	[TRTLLM-9581][infra] Use /home/scratch.trt_llm_data_ci in computelab (#10616 )	2026-01-19 00:40:40 -05:00
local_venv.py	[TRTLLM-5950][infra] Removing remaining turtle keywords from the code base (#7086 )	2025-09-07 14:26:18 +08:00
pytest.ini	[TRTLLM-9181][feat] improve disagg-server prometheus metrics; synchronize workers' clocks when workers are dynamic (#9726 )	2025-12-16 05:16:32 -08:00
runner_interface.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_cases.yml	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_e2e.py	[https://nvbugs/5787904 ][fix] update mig tests (#11014 )	2026-02-15 19:57:03 +08:00
test_fmha.py	[TRTLLM-9805][feat] Skip Softmax Attention. (#9821 )	2025-12-21 02:52:42 -05:00
test_list_parser.py	[None][feat] add waive by sm version (#8928 )	2025-11-05 19:20:43 -08:00
test_list_validation.py	[Infra]Remove some old keyword (#4552 )	2025-05-31 13:50:45 +08:00
test_mlpf_results.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_sanity.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_unittests.py	[TRTLLM-9642][infra] Increase pytest verbosity for failed tests (#9657 )	2026-01-08 02:33:48 -05:00
trt_test_alternative.py	[TRTLLM-7349][feat] Adding new orchestrator type -- ray (#7520 )	2025-10-04 08:12:24 +08:00