TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-26 13:43:38 +08:00

History

Anish Shanbhag a09b38a862 [TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (#8330 ) Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>		2025-10-28 09:17:26 -07:00
..
baichuan	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
bert
bloom
chatglm	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
clip
cogvlm
commandr
dbrx
deepseek_v1	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
deepseek_v2	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
dit	Support RingAttention in the BertAttention plugin and the DiT model (#3661 )	2025-05-09 08:06:54 +08:00
eagle	[TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (#8330 )	2025-10-28 09:17:26 -07:00
enc_dec	[None][fix] Refactoring to avoid circular import when importing torch models (#6720 )	2025-08-11 18:00:42 -04:00
falcon	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
gemma	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
gpt	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
gptj	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
gptneox
grok	[None][fix] Refactoring to avoid circular import when importing torch models (#6720 )	2025-08-11 18:00:42 -04:00
llama	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
mamba	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
medusa	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
mllama	[TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (#8330 )	2025-10-28 09:17:26 -07:00
mmdit_sd3	[None][chroe] Rename TensorRT-LLM to TensorRT LLM for source code. (#7851 )	2025-09-25 21:02:35 +08:00
mpt
multimodal_encoders
nemotron_nas	[TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (#8330 )	2025-10-28 09:17:26 -07:00
opt
phi	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
phi3	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
qwen	[None][chore] update torch_dtype -> dtype in 'transformers' (#8263 )	2025-10-15 17:09:30 +09:00
recurrentgemma
redrafter	[TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (#8330 )	2025-10-28 09:17:26 -07:00
stdit	[TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (#8330 )	2025-10-28 09:17:26 -07:00
unet	[None][chroe] Rename TensorRT-LLM to TensorRT LLM for source code. (#7851 )	2025-09-25 21:02:35 +08:00
__init__.py	[None][feat] Add Qwen3 MoE support to TensorRT backend (#6470 )	2025-08-06 17:02:35 +08:00
automodel.py	[nvbug/5387226] chore: add propogation for trust_remote_code to AutoConfig (#6001 )	2025-07-16 16:05:38 +08:00
convert_utils.py
generation_mixin.py	[TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (#8330 )	2025-10-28 09:17:26 -07:00
model_weights_loader.py	[None][chroe] Rename TensorRT-LLM to TensorRT LLM for source code. (#7851 )	2025-09-25 21:02:35 +08:00
modeling_utils.py	[TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (#8330 )	2025-10-28 09:17:26 -07:00