TensorRT-LLMs/tensorrt_llm/layers
Kaiyu Xie e153372759
Update TensorRT-LLM (#2253)
* Update TensorRT-LLM

---------

Co-authored-by: Ivan Sorokin <isorokin@nvidia.com>
Co-authored-by: lkm2835 <lkm2835@gmail.com>
2024-09-24 17:27:31 +02:00
..
__init__.py Update TensorRT-LLM (#2253) 2024-09-24 17:27:31 +02:00
activation.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
attention.py Update TensorRT-LLM (#2215) 2024-09-10 18:21:22 +08:00
cast.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
conv.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
embedding.py Update TensorRT-LLM (#2253) 2024-09-24 17:27:31 +02:00
linear.py Update TensorRT-LLM (#2230) 2024-09-17 14:39:09 +08:00
lora.py Update TensorRT-LLM 2024-08-20 18:55:15 +08:00
mlp.py Update TensorRT-LLM (#2230) 2024-09-17 14:39:09 +08:00
moe.py Update TensorRT-LLM (#2253) 2024-09-24 17:27:31 +02:00
normalization.py Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
pooling.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
recurrent.py Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
ssm.py Update TensorRT-LLM 2024-08-20 18:55:15 +08:00