This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-04 18:21:52 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
5f9fc50233
TensorRT-LLMs
/
tests
/
integration
/
defs
/
examples
History
Jie Li
627d306df9
[None][chore] remove some model support; add device constraint (
#10563
)
...
Signed-off-by: Jie Li <lijie@nvidia.com>
2026-01-09 09:36:23 -05:00
..
serve
[None][fix] Fix iteration stats for spec-dec (
#9855
)
2025-12-16 14:11:38 -08:00
run_llm_fp8_quant_llama_70b.py
run_llm_quickstart_atexit.py
test_ad_guided_decoding.py
[
#8245
][feat] Autodeploy: Guided Decoding Support (
#8551
)
2025-10-28 09:29:57 +08:00
test_ad_speculative_decoding.py
[
#9241
][feat] AutoDeploy: Support Eagle3 Speculative Decoding (
#9869
)
2025-12-24 23:30:42 -05:00
test_bert.py
test_bindings.py
test_chatglm.py
test_commandr.py
test_draft_target_model.py
test_eagle.py
test_enc_dec.py
test_exaone.py
test_flux.py
[None][chore] Update the Flux autodeploy example (
#8434
)
2025-11-18 14:16:04 -08:00
test_gemma.py
[TRTLLM-8638][fix] fix test issues (
#8557
)
2025-10-24 02:16:55 -04:00
test_gpt.py
[
https://nvbugs/5552132
][fix] Enable LoRa for GPT OSS Torch (
#8253
)
2025-12-03 15:42:15 +01:00
test_gptj.py
test_granite.py
test_internlm.py
test_llama.py
[None][chore] remove some model support; add device constraint (
#10563
)
2026-01-09 09:36:23 -05:00
test_llm_api_with_mpi.py
test_mamba.py
test_medusa.py
test_mistral.py
[TRTLLM-6496][feat] Add LoRa Torch tests for the latest NIM model list (
#6806
)
2025-10-03 12:10:48 -07:00
test_mixtral.py
test_multimodal.py
test_nemotron_nas.py
[TRTLLM-6496][feat] Add LoRa Torch tests for the latest NIM model list (
#6806
)
2025-10-03 12:10:48 -07:00
test_nemotron.py
test_ngram.py
test_openai.py
test_phi.py
[TRTLLM-6496][feat] Add LoRa Torch tests for the latest NIM model list (
#6806
)
2025-10-03 12:10:48 -07:00
test_qwen2audio.py
test_qwen.py
test_qwenvl.py
test_ray.py
[TRTLLM-9737][chore] Add rl perf reproduce script and enhance the robustness of Ray tests (
#9939
)
2025-12-24 15:27:01 +08:00
test_recurrentgemma.py
test_redrafter.py
test_whisper.py
[
https://nvbugs/5747930
][fix] Use offline tokenizer for whisper models. (
#10121
)
2025-12-20 09:42:07 +08:00