diff --git a/images/LLM-structure-moe.jpg b/images/LLM-structure-moe.jpg new file mode 100644 index 0000000..ac45ff1 Binary files /dev/null and b/images/LLM-structure-moe.jpg differ diff --git a/images/LLM-structure-moe.png b/images/LLM-structure-moe.png deleted file mode 100644 index 4588477..0000000 Binary files a/images/LLM-structure-moe.png and /dev/null differ diff --git a/images/LLM-structure.jpg b/images/LLM-structure.jpg new file mode 100644 index 0000000..2fe5a6b Binary files /dev/null and b/images/LLM-structure.jpg differ diff --git a/images/LLM-structure.png b/images/LLM-structure.png deleted file mode 100755 index bbd93dd..0000000 Binary files a/images/LLM-structure.png and /dev/null differ diff --git a/images/minimind-3.gif b/images/minimind-3.gif new file mode 100644 index 0000000..131d762 Binary files /dev/null and b/images/minimind-3.gif differ diff --git a/images/minimind2.gif b/images/minimind2.gif deleted file mode 100644 index 43c9cd1..0000000 Binary files a/images/minimind2.gif and /dev/null differ diff --git a/index.html b/index.html index 2b43dbf..d415c5d 100644 --- a/index.html +++ b/index.html @@ -528,7 +528,7 @@
- Train a 26M ChatBot from zero.
+ Train a 64M ChatBot from zero.
2 hours. ¥3. One 3090.
That's it.
Parameters
Cost
vs GPT-3
Complete pipeline: Tokenizer → Pretrain → SFT → LoRA → PPO/GRPO/SPO
+Complete pipeline: Tokenizer → Pretrain → SFT → LoRA → DPO → PPO/GRPO/CISPO → Agentic RL
PPO, GRPO, SPO + YaRN length extrapolation. Native PyTorch implementation.
+PPO, GRPO, CISPO + Agentic RL + YaRN length extrapolation. Native PyTorch.
Compatible with vLLM, ollama, llama.cpp, transformers.
+Compatible with vLLM, ollama, llama.cpp, SGLang, transformers.
Drop-in replacement for FastGPT, Open-WebUI, Dify.
+Drop-in replacement for FastGPT, Open-WebUI, Dify. Tool Calling & Adaptive Thinking.
-
-
+
+
+
💭 "Building a Lego plane beats flying first class."