TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Jie Li 627d306df9 [None][chore] remove some model support; add device constraint (#10563 ) Signed-off-by: Jie Li <lijie@nvidia.com>		2026-01-09 09:36:23 -05:00
..
serve
run_llm_fp8_quant_llama_70b.py
run_llm_quickstart_atexit.py
test_ad_guided_decoding.py
test_ad_speculative_decoding.py
test_bert.py
test_bindings.py
test_chatglm.py
test_commandr.py
test_draft_target_model.py
test_eagle.py
test_enc_dec.py
test_exaone.py
test_flux.py
test_gemma.py
test_gpt.py
test_gptj.py
test_granite.py
test_internlm.py
test_llama.py	[None][chore] remove some model support; add device constraint (#10563 )	2026-01-09 09:36:23 -05:00
test_llm_api_with_mpi.py
test_mamba.py
test_medusa.py
test_mistral.py
test_mixtral.py
test_multimodal.py
test_nemotron_nas.py
test_nemotron.py
test_ngram.py
test_openai.py
test_phi.py
test_qwen2audio.py
test_qwen.py
test_qwenvl.py
test_ray.py
test_recurrentgemma.py
test_redrafter.py
test_whisper.py