论文ICLR 2026 Poster2026 年trustworthy medical AI

MedVR：通过 Agent 强化学习实现无标注医学视觉推理

ICLR 2026 Poster accepted paper at ICLR 2026. Medical Vision-Language Models (VLMs) hold immense promise for complex clinical tasks, but their reasoning capabilities are often constrained by text-only paradigms that fail to ground inferences in visual evidence. This limitation not only curtails performance on tasks requiring fine-grained visual analysis but also introduces risks of visual hallucination in safety-critical applications. Thus, we introduce MedVR, a novel reinforcement learning framework that enables annotation-free visual reasoning for medical VLMs. Its core innovation lies in two synergistic mechanisms: Entropy-guided Visual Regrounding (EVR) uses model uncertainty to direct exploration, while Consensus-based Credit Assignment (CCA) distills pseudo-supervision from rollout agreement.

医学影像计算医疗多模态临床语言智能论文 Think with images Medical visual reasoning Medical VQA Agentic reinforcement learning ICLR 2026 ICLR 2026 Poster medical_llm_agent

论文详情

英文标题: MedVR: Annotation-Free Medical Visual Reasoning via Agentic Reinforcement Learning
作者: Zheng Jiang, Heng Guo, Chengyu Fang, Changchen Xiao, Xinyang Hu, Lifeng Sun, Minfeng Xu
期刊/会议: ICLR 2026 Poster
发表年份: 2026 年
研究方向: trustworthy medical AI