[CVPR 2026] Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"
-
Updated
Feb 13, 2026 - Python
[CVPR 2026] Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"
[CVPR 2026] AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition
A think-with-image GUI visual grounding model.
Add a description, image, and links to the think-with-image topic page so that developers can more easily learn about it.
To associate your repository with the think-with-image topic, visit your repo's landing page and select "manage topics."