论文ICLR 2026 Poster2026 年trustworthy medical AI 先验感知与上下文引导的主动概率子采样分组
ICLR 2026 Poster accepted paper at ICLR 2026. Subsampling significantly reduces the number of measurements, thereby streamlining data processing and transfer overhead, and shortening acquisition time across diverse real-world applications. The recently introduced Active Deep Probabilistic Subsampling (A-DPS) approach jointly optimizes both the subsampling pattern and the downstream task model, enabling instance- and subject-specific sampling trajectories and effective adaptation to new data at inference time. However, this approach does not fully leverage valuable dataset priors and relies on top-1 sampling, which can impede the optimization process. Herein, we enhance A-DPS by integrating a deterministic (fixed) prior-informed sampling pattern derived from the training dataset, along with group-based sampling via top-k sampling, to achieve more robust optimization—method we call Prior-aware and context-guided Group-based Active DPS (PGA-DPS).
论文ICLR 2026 Poster2026 年医学影像 Disco:通过邻接感知协同着色实现密集重叠细胞实例分割
ICLR 2026 Poster accepted paper at ICLR 2026. Accurate cell instance segmentation is foundational for digital pathology analysis. Existing methods based on contour detection and distance mapping still face significant challenges in processing complex and dense cellular regions. Graph coloring-based methods provide a new paradigm for this task, yet the effectiveness of this paradigm in real-world scenarios with dense overlaps and complex topologies has not been verified. Addressing this issue, we release a large-scale dataset GBC-FS 2025, which contains highly complex and dense sub-cellular nuclear arrangements. We conduct the first systematic analysis of the chromatic properties of cell adjacency graphs across four diverse datasets and reveal an important discovery: most real-world cell graphs are non-bipartite, with a high prevalence of odd-length cycles (predominantly triangles).
论文ICLR 2026 Poster2026 年trustworthy medical AI 序贯信息瓶颈融合:迈向鲁棒且可泛化的多模态脑肿瘤分割
ICLR 2026 Poster accepted paper at ICLR 2026. Brain tumor segmentation in multi-modal MRIs poses significant challenges when one or more modalities are missing. Recent approaches commonly employ parallel fusion strategies; however, these methods often risk losing crucial shared information across modalities, which can degrade segmentation performance. In this paper, we advocate leveraging sequential information bottleneck fusion to effectively preserve shared information across modalities. From an information-theoretic perspective, sequential fusion not only produces more robust fused representations in missing-data scenarios but also achieves a tighter generalization upper bound compared to parallel fusion approaches.
论文ICLR 2026 Poster2026 年trustworthy medical AI Dual-Kernel Adapter:拓展数据受限医学图像分析的空间视野
ICLR 2026 Poster accepted paper at ICLR 2026. Adapters have become a widely adopted strategy for efficient fine-tuning of foundation models, particularly in resource-constrained settings. However, their performance under extreme data scarcity—common in medical imaging due to high annotation costs, privacy regulations, and fragmented datasets—remains underexplored. In this work, we present the first comprehensive study of adapter-based fine-tuning for vision foundation models in low-data medical imaging scenarios. We find that, contrary to their promise, conventional Adapters can degrade performance under severe data constraints, performing even worse than simple linear probing when trained on less than 1\% of the corresponding training data.
论文ICLR 2026 Poster2026 年trustworthy medical AI Johnson-Lindenstrauss 引理引导的高效 3D 医学分割网络
ICLR 2026 Poster accepted paper at ICLR 2026. Lightweight 3D medical image segmentation remains constrained by a fundamental "efficiency / robustness conflict", particularly when processing complex anatomical structures and heterogeneous modalities. In this paper, we study how to redesign the framework based on the characteristics of high-dimensional 3D images, and explore data synergy to overcome the fragile representation of lightweight methods. Our approach, VeloxSeg, begins with a deployable and extensible dual-stream CNN-Transformer architecture composed of Paired Window Attention (PWA) and Johnson-Lindenstrauss lemma-guided convolution (JLC). For each 3D image, we invoke a "glance-and-focus" principle, where PWA rapidly retrieves multi-scale information, and JLC ensures robust local feature extraction with minimal parameters, significantly enhancing the model's ability to operate with low computational budget. Code/project link: https://github.com/JinPLu/VeloxSeg
论文ICLR 2026 Poster2026 年trustworthy medical AI 用谱熵正则重新思考医学图像分割中的模型校准
ICLR 2026 Poster accepted paper at ICLR 2026. Deep neural networks for medical image segmentation often produce overconfident predictions, posing clinical risks due to miscalibrated uncertainty estimates. In this work, we rethink model calibration from a frequency-domain perspective and identify two critical factors causing miscalibration: spectral bias, where models overemphasize low-frequency components, and confidence saturation, which suppresses overall power spectral density in confidence maps. To address these challenges, we propose a novel frequency-aware calibration framework integrating spectral entropy regularization and power spectral smoothing. The spectral entropy term promotes a balanced frequency spectrum and enhances overall spectral power, enabling better modeling of high-frequency boundary and low-frequency structural uncertainty.
论文ICLR 2026 Poster2026 年trustworthy medical AI COMPASS:医学分割指标的鲁棒特征保形预测
ICLR 2026 Poster accepted paper at ICLR 2026. In clinical applications, the utility of segmentation models is often based on the accuracy of derived downstream metrics such as organ size, rather than by the pixel-level accuracy of the segmentation masks themselves. Thus, uncertainty quantification for such metrics is crucial for decision-making. Conformal prediction (CP) is a popular framework to derive such principled uncertainty guarantees, but applying CP naively to the final scalar metric is inefficient because it treats the complex, non-linear segmentation-to-metric pipeline as a black box. We introduce COMPASS, a practical framework that generates efficient, metric-based CP intervals for image segmentation models by leveraging the inductive biases of their underlying deep neural networks.
论文ICLR 2026 Poster2026 年surgical/interventional AI WavePolyp:基于层级小波特征聚合与帧间差异感知的视频息肉分割
ICLR 2026 Poster accepted paper at ICLR 2026. Automatic polyp segmentation from colonoscopy videos is a crucial technique that assists clinicians in improving the accuracy and efficiency of diagnosis, preventing polyps from developing into cancer. However, video polyp segmentation (VPS) is a challenging task due to (1) the significant inter-frame divergence in videos, (2) the high camouflage of polyps in normal colon structures and (3) the clinical requirement of real-time performance. In this paper, we propose a novel segmentation network, WavePolyp, which consists of two innovative components: a hierarchical wavelet-based feature aggregation (HWFA) module and inter-frame divergence perception (IDP) blocks. Specifically, HWFA excavates and amplifies discriminative information from high-frequency and low-frequency features decomposed by wavelet transform, hierarchically aggregating them into refined spatial representations within each frame. Code/project link: https://github.com/FishballZhang/WavePolyp
论文ICLR 2026 Poster2026 年clinical NLP 迈向医学图像分割中的文本-掩膜一致性
ICLR 2026 Poster accepted paper at ICLR 2026. Vision-language models for medical image segmentation often produce masks that conflict with the accompanying text, especially under multi-site/multi-lesion descriptions. We trace this failure to two factors: (i) highly templated and repetitive clinical language causes one-to-one hard contrastive learning to yield numerous false negatives, weakening cross-modal alignment; and (ii) predominantly vision-driven, one-way cross-attention lacks a language-dominant, spatially aware pathway, hindering effective injection of textual semantics into the spatial visual domain. To this end, we propose Consistency-enhanced Two-stage Segmentation (C2Seg). In the pretraining stage, Cluster-aware Contrastive Learning uses a frozen strong baseline to construct an intra-batch text similarity matrix as soft labels, thereby alleviating false negative conflicts and producing more discriminative visual representations.
论文ICLR 2026 Poster2026 年trustworthy medical AI PathChat-SegR1:通过 SO-GRPO 实现病理推理分割
ICLR 2026 Poster accepted paper at ICLR 2026. Segmentation in pathology image requires handling out-of-domain tissue morphologies and new pathologies beyond training distributions, where traditional closed-set segmentation approaches fail to generalize. Reasoning segmentation enables zero-shot generalization via prompting with text queries. However, existing reasoning segmentation models face three barriers when applied to pathology: (1) the vision encoder lack pathology-specific knowledge and robustness to staining variations, (2) the large language model (LLM) backbone for reasoning fails to identify whether it has gathered sufficient semantic context to trigger the segmentation output, and (3) no reasoning segmentation benchmarks and datasets exist for pathology analysis. Consequently, we introduce PathChat-SegR1, a reasoning segmentation model built upon pathology-specific vision encoders trained with a novel stain-invariant self-distillation for robust pathology image representations.
论文ICLR 2026 Poster2026 年医学影像 你指点,我学习:交互式分割模型在线适配医学影像分布偏移
ICLR 2026 Poster accepted paper at ICLR 2026. Interactive segmentation uses real-time user inputs, such as mouse clicks, to iteratively refine model predictions. Although not originally designed to address distribution shifts, this paradigm naturally lends itself to such challenges. In medical imaging, where distribution shifts are common, interactive methods can use user inputs to guide models towards improved predictions. Moreover, once a model is deployed, user corrections can be used to adapt the network parameters to the new data distribution, mitigating distribution shift. Based on these insights, we aim to develop a practical, effective method for improving the adaptive capabilities of interactive segmentation models to new data distributions in medical imaging. Code/project link: https://github.com/WenTXuL/OAIMS
论文ICLR 2026 Poster2026 年trustworthy medical AI Cross-Timestep:用于医学分割的跨时序记忆 LSTM 与自适应先验解码 3D 扩散模型
ICLR 2026 Poster accepted paper at ICLR 2026. Diffusion models have recently demonstrated significant robustness in medical image segmentation, effectively accommodating variations across different imaging styles. However, their applications remain limited due to: (i) current successes being primarily confined to 2D segmentation tasks—we observe that diffusion models tend to collapse at the early stage when applied to 3D medical tasks; and (ii) the inherently isolated iteration along timesteps during training and inference. To tackle these limitations, we propose a novel framework named Cross-Timestep, which incorporates two key innovations: an Adaptive Priori Decoding Strategy (APDS) and a trans-temporal memory LSTM (tLSTM) mechanism. (i) The APDS provides prior guidance during the diffusion process by employing a Priori Decoder(PD) that focuses solely on the conditional branch, successfully stabilizing the reverse diffusion process.
论文ICLR 2026 Poster2026 年医学影像 MedGMAE:面向医学体数据表征学习的 Gaussian 掩码自编码器
ICLR 2026 Poster accepted paper at ICLR 2026. Self-supervised pre-training has emerged as a critical paradigm for learning transferable representations from unlabeled medical volumetric data. Masked autoencoder based methods have garnered significant attention, yet their application to volumetric medical image faces fundamental limitations from the discrete voxel-level reconstruction objective, which neglects comprehensive anatomical structure continuity. To address this challenge, We propose MedGMAE, a novel framework that replaces traditional voxel reconstruction with 3D Gaussian primitives reconstruction as new perspectives on representation learning. Our approach learns to predict complete sets of 3D Gaussian parameters as semantic abstractions to represent the entire 3D volume, from sparse visible image patches. Code/project link: https://github.com/windrise/MedGMAE; https://anonymous.4open.science/r/MedGMAE-EC8F/
论文ICLR 2026 Poster2026 年surgical/interventional AI HFSTI-Net:视频息肉分割的层级频率-空间-时间交互
ICLR 2026 Poster accepted paper at ICLR 2026. Automatic video polyp segmentation (VPS) is crucial for preventing and treating colorectal cancer by ensuring accurate identification of polyps in colonoscopy examinations. However, its clinical application is hampered by two key challenges: shape collapse, which compromises structural integrity, and episodic amnesia, which causes instability in challenging video sequences. To address these challenges, we present a novel video segmentation network, \emph{HFSTI-Net}, which integrates global perception with spatiotemporal consistency in spatial, temporal, and frequency domains. Specifically, to address shape collapse under low contrast or visual ambiguity, we design a Hierarchical Frequency-spatial Interaction (HFSI) module that fuses spatial and frequency cues for fine-grained boundary localization. Code/project link: https://github.com/Yuanqin-He/HFSTI-Net
论文ICLR 2026 Poster2026 年clinical prediction 泛癌筛查中的扫视-聚焦强化机制
ICLR 2026 Poster accepted paper at ICLR 2026. Pan-cancer screening in large-scale CT scans remains challenging for existing AI methods, primarily due to the difficulty of localizing diverse types of tiny lesions in large CT volumes. The extreme foreground-background imbalance significantly hinders models from focusing on diseased regions, while redundant focus on healthy regions not only decreases the efficiency but also increases false positives. Inspired by radiologists' glance and focus diagnostic strategy, we introduce GF-Screen, a Glance and Focus reinforcement learning framework for pan-cancer screening. GF-Screen employs a Glance model to localize the diseased regions and a Focus model to precisely segment the lesions, where segmentation results of the Focus model are leveraged to reward the Glance model via Reinforcement Learning (RL). Code/project link: https://github.com/Luffy03/GF-Screen
论文ICLR 2026 Poster2026 年medical LLM agent K-Prism:知识引导与提示融合的通用医学图像分割模型
ICLR 2026 Poster accepted paper at ICLR 2026. Medical image segmentation is fundamental to clinical decision-making, yet existing models remain fragmented. They are usually trained on single knowledge sources and specific to individual tasks, modalities, or organs. This fragmentation contrasts sharply with clinical practice, where experts seamlessly integrate diverse knowledge: anatomical priors from training, exemplar-based reasoning from reference cases, and iterative refinement through real-time interaction. We present $\textbf{K-Prism}$, a unified segmentation framework that mirrors this clinical flexibility by systematically integrating three knowledge paradigms: (i) $\textit{semantic priors}$ learned from annotated datasets, (ii) $\textit{in-context knowledge}$ from few-shot reference examples, and (iii) $\textit{interactive feedback}$ from user inputs like clicks or scribbles. Code/project link: https://github.com/bangwayne/K-Prism
论文ICLR 2026 Poster2026 年trustworthy medical AI CARE:面向多模态医学推理临床问责的证据扎根 Agent 框架
ICLR 2026 Poster accepted paper at ICLR 2026. Large visual language models (VLMs) have shown strong multi-modal medical reasoning ability, but most operate as end-to-end black boxes, diverging from clinicians’ evidence-based, staged workflows and hindering clinical accountability. Complementarily, expert visual grounding models can accurately localize regions of interest (ROIs), providing explicit, reliable evidence that improves both reasoning accuracy and trust. In this paper, we introduce **CARE**, advancing **C**linical **A**ccountability in multi-modal medical **R**easoning with an **E**vidence-grounded agentic framework. Unlike existing approaches that couple grounding and reasoning within a single generalist model, CARE decomposes the task into coordinated sub-modules to reduce shortcut learning and hallucination: a compact VLM proposes relevant medical entities; an expert entity-referring segmentation model produces pixel-level ROI evidence; and a grounded VLM reasons over the full image augmented by ROI hints.
论文ICLR 2026 Poster2026 年医学影像 建模像素级自监督嵌入密度用于医学 CT 无监督病理分割
ICLR 2026 Poster accepted paper at ICLR 2026. Accurate detection of all pathological findings in 3D medical images remains a significant challenge, as supervised models are limited to detecting only the few pathology classes annotated in existing datasets. To address this, we frame pathology detection as an unsupervised visual anomaly segmentation (UVAS) problem, leveraging the inherent rarity of pathological patterns compared to healthy ones. We enhance the existing density-based UVAS framework with two key innovations: (1) dense self-supervised learning for feature extraction, eliminating the need for supervised pretraining, and (2) learned, masking-invariant dense features as conditioning variables, replacing hand-crafted positional encodings. Trained on over 30,000 unlabeled 3D CT volumes, our fully self-supervised model, Screener, outperforms existing UVAS methods on four large-scale test datasets comprising 1,820 scans with diverse pathologies. Code/project link: https://github.com/mishgon/screener; https://anonymous.4open.science/r/screener-35EE/
论文Nature Communications2024 年医学影像 医学图像中的 Segment Anything
MedSAM adapts the Segment Anything paradigm to medical image segmentation and reports broad evaluation across imaging modalities.
数据资源abdominal CT and MRI with multi-organ annotationsabdominal multi-organ segmentation benchmarkAMOS 2022 challenge benchmark; see official Grand Challenge page申请访问 AMOS 腹部多器官分割基准
AMOS is an abdominal multi-organ segmentation benchmark with CT and MRI cases for evaluating versatile medical image segmentation models. It supports abdominal organ segmentation, modality-general segmentation, and benchmarking of robust 3D segmentation methods.
数据资源retinal fundus photographs with glaucoma and structure annotationsophthalmology fundus image challenge datasetREFUGE challenge dataset; official splits described on Grand Challenge申请访问 REFUGE 视网膜眼底青光眼挑战数据集
REFUGE is a retinal fundus imaging challenge dataset for glaucoma assessment. It supports glaucoma classification, optic disc and cup segmentation, fovea localization, and fair comparison of ophthalmology AI methods on color fundus photographs.
数据资源cine cardiac MRI with segmentation labelscardiac MRI segmentation datasetACDC challenge dataset; see official database page申请访问 ACDC 自动心脏诊断挑战数据集
ACDC is a cardiac MRI dataset for automated cardiac diagnosis and segmentation. It supports left and right ventricular segmentation, myocardium segmentation, cardiac function quantification, and evaluation of robust cardiac image analysis methods.
数据资源brain MRI with demographic and clinical variablesbrain MRI and neuroimaging dataset collectionOASIS cross-sectional and longitudinal releases; see official site开放访问 OASIS 脑 MRI 与神经影像数据集
OASIS provides open-access neuroimaging datasets for studying normal aging, dementia, and brain structure. It is useful for brain MRI segmentation, age prediction, dementia classification, longitudinal modeling, and neuroimaging method benchmarking.
数据资源cardiac ultrasound videos with functional annotationsechocardiography video datasetLarge echocardiography video dataset; see official site申请访问 EchoNet-Dynamic 心脏超声视频数据集
EchoNet-Dynamic is a cardiac ultrasound video dataset with expert annotations for left ventricular function. It is used for echocardiography video understanding, ejection fraction estimation, cardiac segmentation, and clinical video AI research.
数据资源dermoscopic and clinical skin lesion imagesdermatology image archiveLarge public ISIC dermatology image archive开放访问 ISIC Archive 皮肤病学图像数据集
The ISIC Archive is a large public dermatology image repository for skin lesion analysis. It is widely used for melanoma classification, lesion segmentation, dermoscopic image retrieval, bias and domain shift analysis, and clinical imaging benchmark development.
数据资源multimodal brain MRI with tumor annotationsbrain tumor MRI segmentation challenge datasetBraTS 2024 challenge dataset; see Synapse project申请访问 BraTS 2024 脑肿瘤分割挑战数据集
BraTS 2024 provides multimodal brain MRI data and expert annotations for brain tumor segmentation and related tumor subregion analysis. It is a major benchmark for glioma segmentation, radiology AI, and robust multimodal MRI segmentation methods.
数据资源abdominal CT with kidney and tumor annotationskidney tumor CT segmentation datasetTCIA C4KC-KiTS collection; see collection page开放访问 C4KC-KiTS 肾肿瘤分割集合
C4KC-KiTS is a TCIA imaging collection associated with kidney and kidney tumor segmentation benchmarks. It supports kidney segmentation, renal tumor segmentation, surgical planning research, and evaluation of abdominal CT segmentation models.
数据资源thoracic CT images with nodule annotationslung CT nodule datasetTCIA LIDC-IDRI collection开放访问 LIDC-IDRI 肺部 CT 结节数据集
LIDC-IDRI is a lung CT dataset with thoracic CT scans and expert nodule annotations. It is a classic benchmark for lung nodule detection, segmentation, malignancy characterization, radiomics, and computer-aided diagnosis research.
数据资源CT/MRI分割基准10 segmentation tasks开放访问 Medical Segmentation Decathlon 医学分割十项全能
Legacy multi-task biomedical image segmentation benchmark retained as a reference; newer segmentation benchmarks are listed above it.
数据资源Biomedical imagesTool/modelFoundation model and code开放访问 BiomedParse 生物医学图像解析基础模型
Foundation model and toolkit for all-in-one biomedical image parsing across recognition, detection, and segmentation tasks.
数据资源医学影像分割基准IMed-361M / IMIS-Bench开放访问 IMed-361M / IMIS-Bench 交互式医学图像分割基准
Interactive medical image segmentation benchmark and baseline from CVPR 2025, covering multiple modalities, organs, and target structures.
技术竞赛Open soonself-supervised 3D microscopy image segmentation3D light-sheet microscopy截止 北京时间 2026-09-25 3D 光片显微图像分割自监督学习挑战
Grand Challenge official API lists this medical AI challenge with status OPEN_SOON. SSL for 3D light-sheet microscopy image segmentation Start date: 2026-06-20. End/deadline date: 2026-09-25.
技术竞赛Open soonaneurysm image analysisvascular/neurovascular medical imaging开始 北京时间 2026-08-14 TopAneu 2026
Grand Challenge official API lists this medical AI challenge with status OPEN_SOON. Multimodal Vessel-Specific Intracranial Aneurysm Classification and Segmentation Challenge Start date: 2026-08-14.
技术竞赛Open soonischemic stroke lesion segmentationstroke lesion brain imaging截止 北京时间 2026-08-15 2026 缺血性卒中病灶分割挑战
Grand Challenge official API lists this medical AI challenge with status OPEN_SOON. Ischemic Stroke Lesion Segmentation Challenge 2026 Start date: 2026-06-15. End/deadline date: 2026-08-15.
技术竞赛Open soonperipelvic fracture segmentation and reduction planningpelvic fracture CT imaging截止 北京时间 2026-08-19 骨盆周围骨折分割与复位规划挑战
Grand Challenge official API lists this medical AI challenge with status OPEN_SOON. Peripelvic fractures are severe injuries with high disability and mortality rates. The PENGWIN 2026 Challenge aims to advance state-of-the-art techniques for intelligent surgical planning in 3D CT scans. It consists of three tasks: fully automated peripelvic fracture segmentation (Task 1), interactive segmentation (Task 2), and fracture reduction planning (Task 3). The dataset features 500 clinical cases with expert annotations and 16,000 simulated fracture cases to support the training of data-driven reduction models. Start date: 2026-05-10. End/deadline date: 2026-08-19.
技术竞赛Open soonhead and neck tumor lesion segmentation, staging, and prognosishead and neck oncology imaging截止 北京时间 2026-07-24 HECKTOR:头颈部肿瘤病灶分割、分期与预后挑战
Grand Challenge official API lists this medical AI challenge with status OPEN_SOON. HEad and neCK TumOR Lesion Segmentation, Staging and Prognosis Start date: 2026-05-31. End/deadline date: 2026-07-24.
技术竞赛Open soonwhole-body PET/CT automated lesion segmentationwhole-body PET/CT截止 北京时间 2026-09-25 autoPET V:全身 PET/CT 自动病灶分割挑战
Grand Challenge official API lists this medical AI challenge with status OPEN_SOON. Automated Lesion Segmentation in Whole-Body PET/CT Start date: 2026-05-03. End/deadline date: 2026-09-25.
技术竞赛Openbreast cancer histopathology semantic segmentationH&E whole-slide histopathology images开始 北京时间 2026-05-03 BEETLE 乳腺癌组织病理分割挑战
Grand Challenge official API lists this medical AI challenge with status OPEN. BEETLE is a multicenter, multiscanner benchmark for breast cancer histopathology segmentation. It focuses on multiclass semantic segmentation of hematoxylin and eosin (H&E)-stained whole-slide images (WSIs) into four tissue categories: invasive epithelium, non-invasive epithelium, necrosis, and other. The evaluation set comprises 170 densely annotated regions from 54 WSIs, covering all molecular subtypes and histological grades, thereby capturing much of the morphological heterogeneity seen in clinical practice. BEETLE provides a standardized resource for benchmarking breast cancer segmentation models, supporting the development of robust, generalizable algorithms for large-scale biomarke...
技术竞赛Openischemic stroke lesion segmentationbrain imaging for stroke lesion segmentation截止 北京时间 2026-12-31 2024 缺血性卒中病灶分割挑战
Grand Challenge official API lists this medical AI challenge with status OPEN. Ischemic Stroke Lesion Segmentation Challenge 2024 Start date: 2026-03-30. End/deadline date: 2026-12-31.
技术竞赛Submission deadline 2026-08-01 19:59 BeijingEducation challengeMedical image computing education截止 北京时间 2026-08-01 19:59 MICCAI 2026 医学影像计算教育挑战
MICCAI Student Board educational challenge for tutorial-style submissions around medical image computing education.