..
__init__.py
Update TensorRT-LLM ( #2873 )
2025-03-11 21:13:42 +08:00
activation.py
Update TensorRT-LLM ( #787 )
2024-01-02 17:54:32 +08:00
attention.py
refactor: use x is None instead of x == None. ( #4244 )
2025-05-15 20:00:04 +08:00
cast.py
Update TensorRT-LLM ( #787 )
2024-01-02 17:54:32 +08:00
conv.py
Update TensorRT-LLM ( #2873 )
2025-03-11 21:13:42 +08:00
embedding.py
fix : #3137 speculative decoding and multimodal input support ( #3276 )
2025-04-09 23:40:19 +08:00
language_adapter.py
Update TensorRT-LLM ( #2849 )
2025-03-04 18:44:00 +08:00
linear.py
Update TensorRT-LLM ( #2873 )
2025-03-11 21:13:42 +08:00
lora.py
Update TensorRT-LLM ( #2755 )
2025-02-11 03:01:00 +00:00
mlp.py
Update TensorRT-LLM ( #2936 )
2025-03-18 21:25:19 +08:00
moe.py
chore: Mass integration of release/0.20. ( #4871 )
2025-06-04 14:12:27 +08:00
normalization.py
Update TensorRT-LLM ( #2849 )
2025-03-04 18:44:00 +08:00
pooling.py
Update TensorRT-LLM ( #787 )
2024-01-02 17:54:32 +08:00
recurrent.py
Update TensorRT-LLM ( #1954 )
2024-07-16 15:30:25 +08:00
ssm.py
Update TensorRT-LLM ( #2562 )
2024-12-11 00:31:05 -08:00