[feat] update html

This commit is contained in:
jingyaogong 2025-10-26 19:00:24 +08:00
parent c050af959a
commit 1eccfec438

View File

@ -632,11 +632,14 @@
<div class="changelog-content">
<ul>
<li>🔥 RLAIF algorithms: PPO, GRPO, SPO (native PyTorch)</li>
<li>Checkpoint resume training: auto-save & cross-GPU recovery</li>
<li>RLAIF dataset: rlaif-mini.jsonl (10K samples); Simplified DPO dataset with Chinese data</li>
<li>YaRN algorithm for RoPE length extrapolation</li>
<li>Adaptive Thinking in reasoning models</li>
<li>Tool Calling & Reasoning tags support</li>
<li>Complete RLAIF chapter with training curves</li>
<li>SwanLab integration (WandB alternative for China)</li>
<li>Bug fixes and performance improvements</li>
<li>Code standardization & bug fixes</li>
</ul>
</div>
</div>