🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Updated 2026-04-21 14:34:46 +08:00
Ollama Python library
Updated 2026-01-23 16:33:52 +08:00
A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.
Updated 2025-04-22 21:49:27 +08:00
R1-onevision, a visual language model capable of deep CoT reasoning.
Updated 2025-04-13 16:55:08 +08:00
arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors based on paper abstracts.
Updated 2025-03-03 12:49:31 +08:00