Vision-R1_ Incentivizing Reasoning Capability in Multimodal Large Language Models
标题: Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models作者: Wenxuan Huang, Bohan Jia, Zijie Zhai, Shaosheng Cao, Zheyu Ye, Fei Zhao, Zhe Xu, Yao Hu, Shaohui Lin 等年份: 2026发表刊物: ICLR
研究主要背景
DeepSeek-R1-Zero 模型的…
2026/6/23 13:37:36