mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-14 06:27:45 +08:00
Claim support for QwQ 32B (#2877)
Signed-off-by: Laikh Tewari <laikhtewari1@gmail.com>
This commit is contained in:
parent
37644e22bc
commit
456a850e66
@ -56,6 +56,7 @@ In addition, there are two shared files in the parent folder [`examples`](../) f
|
||||
| Qwen2.5-7B(-Instruct)| Y | Y | Y | Y | Y | Y | Y | Y | Ampere+ |
|
||||
| Qwen2.5-32B(-Instruct)| Y | Y | Y | Y | Y | Y | Y | Y | Ampere+ |
|
||||
| Qwen2.5-72B(-Instruct)| Y | Y | Y | Y* | Y | Y | Y | Y | Ampere+ |
|
||||
| QwQ-32B | Y | Y | Y | Y | Y | Y | Y | Y | Ampere+ |
|
||||
|
||||
Please note that Y* sign means that the model does not support all the AWQ + TP combination.
|
||||
|
||||
|
||||
Loading…
Reference in New Issue
Block a user