Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Updated 2026-01-13 13:32:18 +08:00
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Updated 2026-01-13 13:20:15 +08:00
A modular graph-based Retrieval-Augmented Generation (RAG) system
Updated 2026-01-13 10:00:21 +08:00
Ollama Python library
Updated 2026-01-10 15:21:07 +08:00
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Updated 2026-01-07 23:33:47 +08:00
A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.
Updated 2025-04-22 21:49:27 +08:00
R1-onevision, a visual language model capable of deep CoT reasoning.
Updated 2025-04-13 16:55:08 +08:00
arxiv-sanity lite: tag arxiv papers of interest get recommendations of similar papers in a nice UI using SVMs over tfidf feature vectors based on paper abstracts.
Updated 2025-03-03 12:49:31 +08:00