|
mamba
|
feat: Nemotron-H model support (#3430)
|
2025-04-16 14:05:56 -07:00 |
|
__init__.py
|
Update TensorRT-LLM (#2755)
|
2025-02-11 03:01:00 +00:00 |
|
attention.py
|
feat: Support unfused rope in MLA. (#3610)
|
2025-04-17 16:50:49 +08:00 |
|
fused_moe.py
|
Clean up linear.py, mlp.py, gated_mlp.py (#3553)
|
2025-04-16 12:21:44 -07:00 |
|
gated_mlp.py
|
Clean up linear.py, mlp.py, gated_mlp.py (#3553)
|
2025-04-16 12:21:44 -07:00 |
|
linear.py
|
Clean up linear.py, mlp.py, gated_mlp.py (#3553)
|
2025-04-16 12:21:44 -07:00 |
|
logits_procesor.py
|
Update (#2978)
|
2025-03-23 16:39:35 +08:00 |
|
mlp.py
|
Clean up linear.py, mlp.py, gated_mlp.py (#3553)
|
2025-04-16 12:21:44 -07:00 |