论文ICLR 2026 Poster2026 年trustworthy medical AI Johnson-Lindenstrauss 引理引导的高效 3D 医学分割网络
ICLR 2026 Poster accepted paper at ICLR 2026. Lightweight 3D medical image segmentation remains constrained by a fundamental "efficiency / robustness conflict", particularly when processing complex anatomical structures and heterogeneous modalities. In this paper, we study how to redesign the framework based on the characteristics of high-dimensional 3D images, and explore data synergy to overcome the fragile representation of lightweight methods. Our approach, VeloxSeg, begins with a deployable and extensible dual-stream CNN-Transformer architecture composed of Paired Window Attention (PWA) and Johnson-Lindenstrauss lemma-guided convolution (JLC). For each 3D image, we invoke a "glance-and-focus" principle, where PWA rapidly retrieves multi-scale information, and JLC ensures robust local feature extraction with minimal parameters, significantly enhancing the model's ability to operate with low computational budget. Code/project link: https://github.com/JinPLu/VeloxSeg
论文ICLR 2026 Poster2026 年trustworthy medical AI 用谱熵正则重新思考医学图像分割中的模型校准
ICLR 2026 Poster accepted paper at ICLR 2026. Deep neural networks for medical image segmentation often produce overconfident predictions, posing clinical risks due to miscalibrated uncertainty estimates. In this work, we rethink model calibration from a frequency-domain perspective and identify two critical factors causing miscalibration: spectral bias, where models overemphasize low-frequency components, and confidence saturation, which suppresses overall power spectral density in confidence maps. To address these challenges, we propose a novel frequency-aware calibration framework integrating spectral entropy regularization and power spectral smoothing. The spectral entropy term promotes a balanced frequency spectrum and enhances overall spectral power, enabling better modeling of high-frequency boundary and low-frequency structural uncertainty.
论文ICLR 2026 Poster2026 年trustworthy medical AI COMPASS:医学分割指标的鲁棒特征保形预测
ICLR 2026 Poster accepted paper at ICLR 2026. In clinical applications, the utility of segmentation models is often based on the accuracy of derived downstream metrics such as organ size, rather than by the pixel-level accuracy of the segmentation masks themselves. Thus, uncertainty quantification for such metrics is crucial for decision-making. Conformal prediction (CP) is a popular framework to derive such principled uncertainty guarantees, but applying CP naively to the final scalar metric is inefficient because it treats the complex, non-linear segmentation-to-metric pipeline as a black box. We introduce COMPASS, a practical framework that generates efficient, metric-based CP intervals for image segmentation models by leveraging the inductive biases of their underlying deep neural networks.
论文ICLR 2026 Poster2026 年clinical NLP 迈向医学图像分割中的文本-掩膜一致性
ICLR 2026 Poster accepted paper at ICLR 2026. Vision-language models for medical image segmentation often produce masks that conflict with the accompanying text, especially under multi-site/multi-lesion descriptions. We trace this failure to two factors: (i) highly templated and repetitive clinical language causes one-to-one hard contrastive learning to yield numerous false negatives, weakening cross-modal alignment; and (ii) predominantly vision-driven, one-way cross-attention lacks a language-dominant, spatially aware pathway, hindering effective injection of textual semantics into the spatial visual domain. To this end, we propose Consistency-enhanced Two-stage Segmentation (C2Seg). In the pretraining stage, Cluster-aware Contrastive Learning uses a frozen strong baseline to construct an intra-batch text similarity matrix as soft labels, thereby alleviating false negative conflicts and producing more discriminative visual representations.
论文ICLR 2026 Poster2026 年医学影像 你指点,我学习:交互式分割模型在线适配医学影像分布偏移
ICLR 2026 Poster accepted paper at ICLR 2026. Interactive segmentation uses real-time user inputs, such as mouse clicks, to iteratively refine model predictions. Although not originally designed to address distribution shifts, this paradigm naturally lends itself to such challenges. In medical imaging, where distribution shifts are common, interactive methods can use user inputs to guide models towards improved predictions. Moreover, once a model is deployed, user corrections can be used to adapt the network parameters to the new data distribution, mitigating distribution shift. Based on these insights, we aim to develop a practical, effective method for improving the adaptive capabilities of interactive segmentation models to new data distributions in medical imaging. Code/project link: https://github.com/WenTXuL/OAIMS
论文ICLR 2026 Poster2026 年trustworthy medical AI Cross-Timestep:用于医学分割的跨时序记忆 LSTM 与自适应先验解码 3D 扩散模型
ICLR 2026 Poster accepted paper at ICLR 2026. Diffusion models have recently demonstrated significant robustness in medical image segmentation, effectively accommodating variations across different imaging styles. However, their applications remain limited due to: (i) current successes being primarily confined to 2D segmentation tasks—we observe that diffusion models tend to collapse at the early stage when applied to 3D medical tasks; and (ii) the inherently isolated iteration along timesteps during training and inference. To tackle these limitations, we propose a novel framework named Cross-Timestep, which incorporates two key innovations: an Adaptive Priori Decoding Strategy (APDS) and a trans-temporal memory LSTM (tLSTM) mechanism. (i) The APDS provides prior guidance during the diffusion process by employing a Priori Decoder(PD) that focuses solely on the conditional branch, successfully stabilizing the reverse diffusion process.
论文ICLR 2026 Poster2026 年medical LLM agent K-Prism:知识引导与提示融合的通用医学图像分割模型
ICLR 2026 Poster accepted paper at ICLR 2026. Medical image segmentation is fundamental to clinical decision-making, yet existing models remain fragmented. They are usually trained on single knowledge sources and specific to individual tasks, modalities, or organs. This fragmentation contrasts sharply with clinical practice, where experts seamlessly integrate diverse knowledge: anatomical priors from training, exemplar-based reasoning from reference cases, and iterative refinement through real-time interaction. We present $\textbf{K-Prism}$, a unified segmentation framework that mirrors this clinical flexibility by systematically integrating three knowledge paradigms: (i) $\textit{semantic priors}$ learned from annotated datasets, (ii) $\textit{in-context knowledge}$ from few-shot reference examples, and (iii) $\textit{interactive feedback}$ from user inputs like clicks or scribbles. Code/project link: https://github.com/bangwayne/K-Prism
论文Nature Communications2024 年医学影像 医学图像中的 Segment Anything
MedSAM adapts the Segment Anything paradigm to medical image segmentation and reports broad evaluation across imaging modalities.
数据资源abdominal CT and MRI with multi-organ annotationsabdominal multi-organ segmentation benchmarkAMOS 2022 challenge benchmark; see official Grand Challenge page申请访问 AMOS 腹部多器官分割基准
AMOS is an abdominal multi-organ segmentation benchmark with CT and MRI cases for evaluating versatile medical image segmentation models. It supports abdominal organ segmentation, modality-general segmentation, and benchmarking of robust 3D segmentation methods.
数据资源CT/MRI分割基准10 segmentation tasks开放访问 Medical Segmentation Decathlon 医学分割十项全能
Legacy multi-task biomedical image segmentation benchmark retained as a reference; newer segmentation benchmarks are listed above it.
数据资源医学影像分割基准IMed-361M / IMIS-Bench开放访问 IMed-361M / IMIS-Bench 交互式医学图像分割基准
Interactive medical image segmentation benchmark and baseline from CVPR 2025, covering multiple modalities, organs, and target structures.