TensorRT-LLMs/tensorrt_llm/layers
2024-07-26 16:19:24 +08:00
..
__init__.py Update TensorRT-LLM (#1918) 2024-07-09 14:42:22 +08:00
activation.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
attention.py open source 315e9f5ccd286e906d4c0d402fefbf2f69a1febe (#2033) 2024-07-26 16:19:24 +08:00
cast.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
conv.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
embedding.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
linear.py Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
lora.py open source 315e9f5ccd286e906d4c0d402fefbf2f69a1febe (#2033) 2024-07-26 16:19:24 +08:00
mlp.py open source 315e9f5ccd286e906d4c0d402fefbf2f69a1febe (#2033) 2024-07-26 16:19:24 +08:00
moe.py open source 315e9f5ccd286e906d4c0d402fefbf2f69a1febe (#2033) 2024-07-26 16:19:24 +08:00
normalization.py Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
pooling.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
recurrent.py Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
ssm.py Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00