mirror of
https://github.com/Fancy-MLLM/R1-Onevision.git
synced 2026-05-01 11:58:25 +08:00
update README.md
This commit is contained in:
parent
b05ab47fab
commit
3a00b7b75e
@ -1,5 +1,5 @@
|
||||
<div style="text-align: center;">
|
||||
<img src="asset/logo.gif" alt="LOGO">
|
||||
<img src="asset/logo.png" alt="LOGO">
|
||||
</div>
|
||||
|
||||
|
||||
@ -14,7 +14,7 @@
|
||||
|
||||
**R1-OneVision** is a versatile **multimodal reasoning large model**, designed to tackle complex visual reasoning tasks. It seamlessly integrates visual and textual data to offer precise interpretations of multimodal information, excelling in areas such as mathematics, science, deep image understanding, and logical reasoning. With its robust ability to perform multimodal reasoning, **R1-OneVision emerges as a powerful AI assistant capable of addressing a wide range of problem-solving challenges across different domains**.
|
||||
|
||||

|
||||

|
||||
|
||||
## 🗺️ Roadmap for R1-Onevision
|
||||
> R1-Onevision bridges the gap between the multimodal capabilities of Qwen-VL and the deep reasoning abilities of DeepSeek-R1, creating a state-of-the-art multimodal reasoning model that goes beyond the capabilities of GPT-4o.
|
||||
@ -41,3 +41,6 @@
|
||||
### Performance
|
||||
|
||||
## 🏗️ Start
|
||||
|
||||
## 🧑💻 Authors
|
||||
Authors: Yi Yang*, Xiaoxuan He*, Hongkun Pan*, Xiyan Jiang, Yan Deng, Xingtao Yang, Haoyu Lu, Minfeng Zhu†, Bo Zhang†, Wei Chen†
|
||||
BIN
asset/logo.gif
BIN
asset/logo.gif
Binary file not shown.
|
Before Width: | Height: | Size: 3.4 MiB |
BIN
asset/logo.png
Normal file
BIN
asset/logo.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 106 KiB |
Loading…
Reference in New Issue
Block a user