论文ICLR 2026 Poster2026 年clinical prediction

M3CoTBench：医学图像理解中 MLLM 思维链基准

ICLR 2026 Poster accepted paper at ICLR 2026. Chain-of-Thought (CoT) reasoning has proven effective in enhancing large language models by encouraging step-by-step intermediate reasoning, and recent advances have extended this paradigm to Multimodal Large Language Models (MLLMs). In the medical domain, where diagnostic decisions depend on nuanced visual cues and sequential reasoning, CoT aligns naturally with clinical thinking processes. However, current benchmarks for medical image understanding generally focus on the final answer while ignoring the reasoning path. An opaque process lacks reliable bases for judgment, making it difficult to assist doctors in diagnosis.

医学影像计算医疗多模态临床语言智能论文 Chain-of-Thought Multimodal Large Language Models M3CoTBench Benchmark ICLR 2026 ICLR 2026 Poster

论文详情

英文标题: M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding
作者: Juntao Jiang, Jiangning Zhang, Yali bi, BAI Jinsheng, Weixuan Liu, Weiwei Jin, Zhucun Xue, Yong Liu, Xiaobin Hu, Shuicheng YAN
期刊/会议: ICLR 2026 Poster
发表年份: 2026 年
研究方向: clinical prediction