mirror of
https://github.com/jingyaogong/minimind.git
synced 2026-01-13 19:57:20 +08:00
[feat] update html
This commit is contained in:
parent
c050af959a
commit
1eccfec438
@ -632,11 +632,14 @@
|
||||
<div class="changelog-content">
|
||||
<ul>
|
||||
<li>🔥 RLAIF algorithms: PPO, GRPO, SPO (native PyTorch)</li>
|
||||
<li>Checkpoint resume training: auto-save & cross-GPU recovery</li>
|
||||
<li>RLAIF dataset: rlaif-mini.jsonl (10K samples); Simplified DPO dataset with Chinese data</li>
|
||||
<li>YaRN algorithm for RoPE length extrapolation</li>
|
||||
<li>Adaptive Thinking in reasoning models</li>
|
||||
<li>Tool Calling & Reasoning tags support</li>
|
||||
<li>Complete RLAIF chapter with training curves</li>
|
||||
<li>SwanLab integration (WandB alternative for China)</li>
|
||||
<li>Bug fixes and performance improvements</li>
|
||||
<li>Code standardization & bug fixes</li>
|
||||
</ul>
|
||||
</div>
|
||||
</div>
|
||||
|
||||
Loading…
Reference in New Issue
Block a user