论文ICLR 2026 Poster2026 年clinical NLP

VLM-SubtleBench：VLM 距离人类级细微比较推理还有多远？

ICLR 2026 Poster accepted paper at ICLR 2026. The ability to distinguish subtle differences between visually similar images is essential for diverse domains such as industrial anomaly detection, medical imaging, and aerial surveillance. While comparative reasoning benchmarks for vision-language models (VLMs) have recently emerged, they primarily focus on images with large, salient differences and fail to capture the nuanced reasoning required for real-world applications. In this work, we introduce **VLM-SubtleBench**, a benchmark designed to evaluate VLMs on *subtle comparative reasoning*. Our benchmark covers ten difference types—Attribute, State, Emotion, Temporal, Spatial, Existence, Quantity, Quality, Viewpoint, and Action—and curate paired question–image sets reflecting these fine-grained variations.

医学影像计算医疗多模态临床语言智能论文 Vision-language Models Multimodal Large Language Models Comparative Reasoning Benchmark Visual Question Answering ICLR 2026 ICLR 2026 Poster remaining batch

论文详情

英文标题: VLM-SubtleBench: How Far Are VLMs from Human-Level Subtle Comparative Reasoning?
作者: Minkyu Kim, Sangheon Lee, Dongmin Park
期刊/会议: ICLR 2026 Poster
发表年份: 2026 年
研究方向: clinical NLP