TensorRT-LLMs/tensorrt_llm/layers
2025-02-25 21:21:49 +08:00
..
__init__.py Update TensorRT-LLM (#2460) 2024-11-19 18:30:34 +08:00
activation.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
attention.py Update TensorRT-LLM (#2820) 2025-02-25 21:21:49 +08:00
cast.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
conv.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
embedding.py Update TensorRT-LLM (#2460) 2024-11-19 18:30:34 +08:00
linear.py Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
lora.py Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
mlp.py Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
moe.py Update TensorRT-LLM (#2792) 2025-02-18 21:27:39 +08:00
normalization.py Update TensorRT-LLM (#2363) 2024-10-22 20:27:35 +08:00
pooling.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
recurrent.py Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
ssm.py Update TensorRT-LLM (#2562) 2024-12-11 00:31:05 -08:00