TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-02-06 03:01:50 +08:00

History

Venky b3146d095d [TRTC-122][feat] Eagle3 Specdec UX improvements (#10124 ) Signed-off-by: Venky Ganesh <23023424+venkywonka@users.noreply.github.com>		2026-01-22 07:24:11 -08:00
..
serve	[None][feat] Auto download speculative models from HF for pytorch backend, add speculative_model field alias (#10099 )	2026-01-14 21:06:07 -08:00
run_llm_fp8_quant_llama_70b.py
run_llm_quickstart_atexit.py
test_ad_guided_decoding.py	[#8245 ][feat] Autodeploy: Guided Decoding Support (#8551 )	2025-10-28 09:29:57 +08:00
test_ad_speculative_decoding.py	[TRTC-122][feat] Eagle3 Specdec UX improvements (#10124 )	2026-01-22 07:24:11 -08:00
test_bert.py
test_bindings.py
test_chatglm.py
test_commandr.py	[https://nvbugs/5410279 ][test] resubmit timeout refactor (#6337 )	2025-08-05 16:39:25 +08:00
test_draft_target_model.py
test_eagle.py
test_enc_dec.py
test_exaone.py	[https://nvbugs/5410279 ][test] resubmit timeout refactor (#6337 )	2025-08-05 16:39:25 +08:00
test_flux.py	[None][chore] Update the Flux autodeploy example (#8434 )	2025-11-18 14:16:04 -08:00
test_gemma.py	[TRTLLM-8638][fix] fix test issues (#8557 )	2025-10-24 02:16:55 -04:00
test_gpt.py	[https://nvbugs/5552132 ][fix] Enable LoRa for GPT OSS Torch (#8253 )	2025-12-03 15:42:15 +01:00
test_gptj.py
test_granite.py
test_internlm.py
test_llama.py	[None][chore] remove some model support; add device constraint (#10563 )	2026-01-09 09:36:23 -05:00
test_llm_api_with_mpi.py
test_mamba.py
test_medusa.py
test_mistral.py	[TRTLLM-6496][feat] Add LoRa Torch tests for the latest NIM model list (#6806 )	2025-10-03 12:10:48 -07:00
test_mixtral.py
test_multimodal.py
test_nemotron_nas.py	[TRTLLM-6496][feat] Add LoRa Torch tests for the latest NIM model list (#6806 )	2025-10-03 12:10:48 -07:00
test_nemotron.py
test_ngram.py
test_openai.py
test_phi.py	[TRTLLM-6496][feat] Add LoRa Torch tests for the latest NIM model list (#6806 )	2025-10-03 12:10:48 -07:00
test_qwen2audio.py
test_qwen.py
test_qwenvl.py
test_ray.py	[TRTLLM-9737][chore] Add rl perf reproduce script and enhance the robustness of Ray tests (#9939 )	2025-12-24 15:27:01 +08:00
test_recurrentgemma.py
test_redrafter.py
test_whisper.py	[https://nvbugs/5747930 ][fix] Use offline tokenizer for whisper models. (#10121 )	2025-12-20 09:42:07 +08:00