forforever73
b83111815e
model : support Step3.5-Flash (#19283)
* Support Step3.5-Flash
* fix: norm.weight + 1 (HF zero_centered=true)
* step35: simplify GGUF conversion + drop redundant rope KVs
* Address review feedback
* rename limits -> clamp
* Apply suggestions from code review
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* Apply suggestion from @CISC
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* rename swiglu limits -> swiglu clamp in LLM_KV
* avoid CI fail
* Apply suggestions from code review
* Apply suggestions from code review
* disabled KV shifting for LLM_ARCH_STEP35
* Apply suggestions from code review
* mistakenly removed cmath
* add model size && apply missed suggestion
* assert partial_rotary_factors
* fix CI errors:
* load freq_base_swa
---------
Co-authored-by: lvyichen <lvyichen@stepfun.com>
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2026-02-06 21:06:14 +01:00
..
2026-01-05 09:14:04 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-01-01 18:38:51 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-01-02 19:01:56 +02:00
2026-01-05 09:14:04 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-02-02 08:38:55 +02:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-11-10 22:55:30 +01:00
2025-10-31 23:40:23 +01:00
2026-01-13 23:28:38 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2026-01-05 09:14:04 +01:00
2026-01-02 19:01:56 +02:00
2026-01-23 18:22:34 +02:00
2026-01-02 19:01:56 +02:00
2025-10-31 23:40:23 +01:00
2025-12-16 11:25:26 +01:00
2025-12-16 11:25:26 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-02-06 11:39:58 +01:00
2025-11-27 16:04:29 +02:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-01-05 09:14:04 +01:00
2025-12-24 14:02:36 +01:00
2026-01-02 20:11:59 +01:00
2025-10-31 23:40:23 +01:00
2025-12-24 23:07:08 +01:00
2026-01-22 22:09:01 +02:00
2025-10-31 23:40:23 +01:00
2025-12-01 12:26:52 +01:00
2026-02-06 21:06:14 +01:00
2026-01-05 09:14:04 +01:00
2025-10-31 23:40:23 +01:00
2026-01-21 18:31:34 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-01-05 09:14:04 +01:00
2026-02-03 14:20:57 +01:00
2025-11-04 12:29:15 +01:00
2025-11-05 10:28:58 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-10-31 23:40:23 +01:00
2025-12-28 17:28:31 +01:00
2025-11-04 12:29:15 +01:00
2026-01-22 22:09:01 +02:00
2025-12-15 18:51:43 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-02-04 17:55:31 +01:00
2026-01-23 18:22:34 +02:00
2026-01-23 18:22:34 +02:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-11-24 14:16:56 +08:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2026-01-05 09:14:04 +01:00
2025-11-04 12:29:15 +01:00
2025-10-31 23:40:23 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2026-02-06 21:06:14 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00
2025-11-04 12:29:15 +01:00