Frida Hou
|
7910d4d2a9
|
[#8242][feat] Add int4 GPTQ support for AutoDeploy (#8248)
Signed-off-by: Fridah-nv <201670829+Fridah-nv@users.noreply.github.com>
|
2026-01-30 23:07:24 -08:00 |
|
Lucas Liebenwein
|
ff3a494f5c
|
[#10013][feat] AutoDeploy: native cache manager integration (#10635)
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
|
2026-01-27 11:23:22 -05:00 |
|
tcherckez-nvidia
|
43b8a5561c
|
[None][chore] update AD model list (#10981)
Signed-off-by: Tal Cherckez <127761168+tcherckez-nvidia@users.noreply.github.com>
|
2026-01-26 16:49:50 +02:00 |
|
tcherckez-nvidia
|
f6c4dd885f
|
[None][chore] Update AutoDeploy model list (#10505)
Signed-off-by: Tal Cherckez <127761168+tcherckez-nvidia@users.noreply.github.com>
|
2026-01-10 08:47:37 +02:00 |
|
tcherckez-nvidia
|
56ef97e06e
|
[#10246][feature] Move AD dashboard to use cudagraph compile backend (#10267)
Signed-off-by: Tal Cherckez <127761168+tcherckez-nvidia@users.noreply.github.com>
|
2025-12-24 11:09:59 +02:00 |
|
tcherckez-nvidia
|
64bb1a5155
|
[None][chore] Update AD coverage to use torch-cudagraph (#10233)
Signed-off-by: Tal Cherckez <127761168+tcherckez-nvidia@users.noreply.github.com>
|
2025-12-23 07:20:32 -05:00 |
|
tcherckez-nvidia
|
9f6abaf59f
|
[#9640][feat] Migrate model registry to v2.0 format with composable configs (#9836)
Signed-off-by: Tal Cherckez <127761168+tcherckez-nvidia@users.noreply.github.com>
|
2025-12-19 05:30:02 -08:00 |
|