TensorRT-LLMs/tensorrt_llm/layers
2024-08-29 17:25:07 +08:00
..
__init__.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
activation.py Update TensorRT-LLM Release branch (#1192) 2024-02-29 17:20:55 +08:00
attention.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
cast.py Update TensorRT-LLM Release branch (#1192) 2024-02-29 17:20:55 +08:00
conv.py Update TensorRT-LLM Release branch (#1192) 2024-02-29 17:20:55 +08:00
embedding.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
linear.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
lora.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
mlp.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
moe.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
normalization.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
pooling.py Update TensorRT-LLM Release branch (#1192) 2024-02-29 17:20:55 +08:00
recurrent.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
ssm.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00