论文ICLR 2026 Poster2026 年trustworthy medical AI 面向垂直联邦学习的隐私保障标签遗忘:无需披露的少样本遗忘
ICLR 2026 Poster accepted paper at ICLR 2026. This paper addresses the critical challenge of unlearning in Vertical Federated Learning (VFL), a setting that has received far less attention than its horizontal counterpart. Specifically, we propose the first method tailored to *label unlearning* in VFL, where labels play a dual role as both essential inputs and sensitive information. To this end, we employ a representation-level manifold mixup mechanism to generate synthetic embeddings for both unlearned and retained samples. This is to provide richer signals for the subsequent gradient-based label forgetting and recovery steps. These augmented embeddings are then subjected to gradient-based label forgetting, effectively removing the associated label information from the model. Code/project link: https://github.com/bryanhx/Towards-Privacy-Guaranteed-Label-Unlearning-in-Vertical-Federated-Learning
论文ICLR 2026 Oral2026 年medical multimodal 面向多模态 GigaVoxel 图像配准的可扩展分布式框架
ICLR 2026 Oral accepted paper at ICLR 2026. In this work, we propose FFDP, a set of IO-aware non-GEMM fused kernels supplemented with a distributed framework for image registration at unprecedented scales. Image registration is an inverse problem fundamental to biomedical and life sciences, but algorithms have not scaled in tandem with image acquisition capabilities. Our framework complements existing model parallelism techniques proposed for large-scale transformer training by optimizing non-GEMM bottlenecks and enabling convolution-aware tensor sharding. We demonstrate unprecedented capabilities by performing multimodal registration of a 100μm ex-vivo human brain MRI volume at native resolution – an inverse problem more than 570× larger than a standard clinical datum in about a minute using only 8 A6000 GPUs.
论文ICLR 2026 Poster2026 年clinical prediction 视频理解中的人脑:动态专家混合模型
ICLR 2026 Poster accepted paper at ICLR 2026. The human brain is the most efficient and versatile system for processing dynamic visual input. By comparing representations from deep video models to brain activity, we can gain insights into mechanistic solutions for effective video processing, important to better understand the brain and to build better models. Current works in model-brain alignment primarily focus on fMRI measurements, leaving open questions about fine-grained dynamic processing. Here, we introduce the first large-scale model benchmarking on alignment to dynamic electroencephalography (EEG) recordings of short natural videos. We analyze 100+ models across the axes of temporal integration, classification task, architecture, and pretraining, using our proposed Cross-Temporal Representational Similarity Analysis (CT-RSA) which matches the best time-unfolded model features to dynamically evolving brain responses, distilling $10^7$ alignment scores.
论文ICLR 2026 Poster2026 年医学影像 MnemoDyn:从 4 万条 fMRI 序列学习静息态动力学
ICLR 2026 Poster accepted paper at ICLR 2026. We present a dynamical-systems based model for resting-state functional magnetic resonance imaging (rs-fMRI), trained on a dataset of roughly $40$K rs-fMRI sequences covering a wide variety of public and available-by-permission datasets. While most existing proposals use transformer backbones, we utilize multi-resolution temporal modeling of the dynamics across parcellated brain regions. We show that MnemoDyn is compute efficient and generalizes very well across diverse populations and scanning protocols. When benchmarked against current state-of-the-art transformer-based approaches, MnemoDyn consistently delivers superior reconstruction quality.
论文ICLR 2026 Poster2026 年医学影像 受认知过程启发的主体无关脑视觉解码架构
ICLR 2026 Poster accepted paper at ICLR 2026. Subject-agnostic brain decoding, which aims to reconstruct continuous visual experiences from fMRI without subject-specific training, holds great potential for clinical applications. However, this direction remains underexplored due to challenges in cross-subject generalization and the complex nature of brain signals. In this work, we propose Visual Cortex Flow Architecture (VCFlow), a novel hierarchical decoding framework that explicitly models the ventral-dorsal architecture of the human visual system to learn multi-dimensional representations. By disentangling and leveraging features from early visual cortex, ventral, and dorsal streams, VCFlow captures diverse and complementary cognitive information essential for visual reconstruction.
论文ICLR 2026 Poster2026 年trustworthy medical AI 先验感知与上下文引导的主动概率子采样分组
ICLR 2026 Poster accepted paper at ICLR 2026. Subsampling significantly reduces the number of measurements, thereby streamlining data processing and transfer overhead, and shortening acquisition time across diverse real-world applications. The recently introduced Active Deep Probabilistic Subsampling (A-DPS) approach jointly optimizes both the subsampling pattern and the downstream task model, enabling instance- and subject-specific sampling trajectories and effective adaptation to new data at inference time. However, this approach does not fully leverage valuable dataset priors and relies on top-1 sampling, which can impede the optimization process. Herein, we enhance A-DPS by integrating a deterministic (fixed) prior-informed sampling pattern derived from the training dataset, along with group-based sampling via top-k sampling, to achieve more robust optimization—method we call Prior-aware and context-guided Group-based Active DPS (PGA-DPS).
论文ICLR 2026 Poster2026 年trustworthy medical AI Brain-Semantoks:用自蒸馏基础模型学习脑动力学语义 token
ICLR 2026 Poster accepted paper at ICLR 2026. The development of foundation models for functional magnetic resonance imaging (fMRI) time series holds significant promise for predicting phenotypes related to disease and cognition. Current models, however, are often trained using a mask-and-reconstruct objective on small brain regions. This focus on low-level information leads to representations that are sensitive to noise and temporal fluctuations, necessitating extensive fine-tuning for downstream tasks. We introduce Brain-Semantoks, a self-supervised framework designed specifically to learn abstract representations of brain dynamics. Its architecture is built on two core innovations: a semantic tokenizer that aggregates noisy regional signals into robust tokens representing functional networks, and a self-distillation objective that enforces representational stability across time.
论文ICLR 2026 Poster2026 年trustworthy medical AI 序贯信息瓶颈融合:迈向鲁棒且可泛化的多模态脑肿瘤分割
ICLR 2026 Poster accepted paper at ICLR 2026. Brain tumor segmentation in multi-modal MRIs poses significant challenges when one or more modalities are missing. Recent approaches commonly employ parallel fusion strategies; however, these methods often risk losing crucial shared information across modalities, which can degrade segmentation performance. In this paper, we advocate leveraging sequential information bottleneck fusion to effectively preserve shared information across modalities. From an information-theoretic perspective, sequential fusion not only produces more robust fused representations in missing-data scenarios but also achieves a tighter generalization upper bound compared to parallel fusion approaches.
论文ICLR 2026 Poster2026 年医学影像 脑图基础模型:跨多图谱与疾病的预训练和提示微调
ICLR 2026 Poster accepted paper at ICLR 2026. As large language models (LLMs) continue to revolutionize AI research, there is a growing interest in building large-scale brain foundation models to advance neuroscience. While most existing brain foundation models are pre-trained on time-series signals or connectome features, we propose a novel graph-based pre-training paradigm for constructing a brain graph foundation model. In this paper, we introduce the Brain Graph Foundation Model, termed BrainGFM, a unified framework that leverages graph contrastive learning and graph masked autoencoders for large-scale fMRI-based pre-training. BrainGFM is pre-trained on a diverse mixture of brain atlases with varying parcellations, significantly expanding the pre-training corpus and enhancing the model’s ability to generalize across heterogeneous fMRI-derived brain representations. Code/project link: https://github.com/weixinxu666/BrainGFM
论文ICLR 2026 Poster2026 年trustworthy medical AI UltraGauss:3D 超声体数据的超快速 Gaussian 重建
ICLR 2026 Poster accepted paper at ICLR 2026. Ultrasound imaging is widely used due to its safety, affordability, and real-time capabilities, but its 2D interpretation is highly operator-dependent, leading to variability and increased cognitive demand. We present $\textbf{UltraGauss}$: an ultrasound-specific Gaussian Splatting framework that serves as an efficient approximation to acoustic image formation. Unlike projection-based splatting, UltraGauss renders by $\textit{probe-plane intersection}$ with in-plane aggregation, aligning with plane-based echo sampling while remaining fast and memory-efficient. A stable parameterisation and compute-aware GPU rasterisation make this method practical at scale. Code/project link: https://www.robots.ox.ac.uk/~vgg/research/UltraGauss/
论文ICLR 2026 Poster2026 年clinical prediction 基于脉冲的数字大脑:脑活动分析的新型基础模型
ICLR 2026 Poster accepted paper at ICLR 2026. Modeling the temporal dynamics of the human brain remains a core challenge in computational neuroscience and artificial intelligence. Traditional methods often ignore the biological spike characteristics of brain activity and find it difficult to reveal the dynamic dependencies and causal interactions between brain regions, limiting their effectiveness in brain function research and clinical applications. To address this issue, we propose a Spike-based Digital Brain (Spike-DB), a novel fundamental model that introduces the spike computing paradigm into brain time series modeling. Spike-DB encodes fMRI signals as spike trains and learns the temporal driving relationships between anchor and target regions to achieve high-precision prediction of brain activity and reveal underlying causal dependencies and dynamic relationship characteristics. Code/project link: https://github.com/UAIBC-Brain/Spike-DB
论文ICLR 2026 Poster2026 年医学影像 统一脑表面与脑体积配准
ICLR 2026 Poster accepted paper at ICLR 2026. Accurate registration of brain MRI scans is fundamental for cross-subject analysis in neuroscientific studies. This involves aligning both the cortical surface of the brain and the interior volume. Traditional methods treat volumetric and surface-based registration separately, which often leads to inconsistencies that limit downstream analyses. We propose a deep learning framework, UCS, that registers 3D brain MRI images by jointly aligning both cortical and subcortical regions, through a unified volume-and-surface-based representation. Our approach leverages an intermediate spherical coordinate space to bridge anatomical surface topology with volumetric anatomy, enabling consistent and anatomically accurate alignment.
论文ICLR 2026 Poster2026 年医学影像 超越网格锁定体素:连续脑编码的神经响应函数
ICLR 2026 Poster accepted paper at ICLR 2026. Neural encoding models aim to predict fMRI-measured brain responses to natural images. fMRI data is acquired as a 3D volume of voxels, where each voxel has a defined spatial location in the brain. However, conventional encoding models often flatten this volume into a 1D vector and treat voxel responses as independent outputs. This removes spatial context, discards anatomical information, and ties each model to a subject-specific voxel grid. We introduce the NRF Neural Response Function, a framework that models fMRI activity as a continuous function over anatomical space rather than a flat vector of voxels. NRF represents brain activity as a continuous implicit function: given an image and a spatial coordinate (x, y, z) in standardized MNI space, the model predicts the response at that location.
论文ICLR 2026 Poster2026 年clinical prediction MRI 运动校正的可靠评测:数据集与洞见
ICLR 2026 Poster accepted paper at ICLR 2026. Correcting motion artifacts in scientific and medical imaging is important, as they significantly impact image quality. However, evaluating deep learning-based and classical motion correction methods remains fundamentally difficult due to the lack of accessible ground-truth target data. To address this challenge, we study three evaluation approaches: real-world evaluation based on reference scans, simulated motion, and reference-free evaluation, each with its merits and shortcomings. To enable evaluation with real-world motion artifacts, we release PMoC3D, a dataset consisting of unprocessed $\textbf{P}$aired $\textbf{Mo}$tion-$\textbf{C}$orrupted $\textbf{3D}$ brain MRI data.
论文ICLR 2026 Poster2026 年clinical prediction 基于小波图像变换与谱流匹配的功能 MRI 时间序列生成,用于脑疾病识别
ICLR 2026 Poster accepted paper at ICLR 2026. Functional Magnetic Resonance Imaging (fMRI) provides non-invasive access to dynamic brain activity by measuring blood oxygen level-dependent (BOLD) signals over time. However, the resource-intensive nature of fMRI acquisition limits the availability of high-fidelity samples required for data-driven brain analysis models. While modern generative models can synthesize fMRI data, they often remain challenging in replicating their inherent non-stationarity, intricate spatiotemporal dynamics, and physiological variations of raw BOLD signals. To address these challenges, we propose Dual-Spectral Flow Matching (DSFM), a novel fMRI generative framework that cascades dual frequency representation of BOLD signals with spectral flow matching. Code/project link: https://anonymous.4open.science/r/DSFM-123C; https://anonymous.4open.science/r/DSFM-
论文ICLR 2026 Poster2026 年trustworthy medical AI 单模态基础模型的联合适配用于多模态阿尔茨海默病诊断
ICLR 2026 Poster accepted paper at ICLR 2026. Alzheimer’s Disease (AD) is a progressive neurodegenerative disorder and a leading cause of dementia worldwide. Accurate diagnosis requires integrating diverse patient data modalities. With the rapid advancement of foundation models in neurobiology and medicine, integrating foundation models from various modalities has emerged as a promising yet underexplored direction for multi-modal AD diagnosis. A central challenge is enabling effective interaction among these models without disrupting the robust, modality-specific representations learned from large-scale pretraining. To address this, we propose a novel multi-modal framework for AD diagnosis that enables joint interaction among uni-modal foundation models through modality-anchored interaction.
论文ICLR 2026 Poster2026 年医学影像 你指点,我学习:交互式分割模型在线适配医学影像分布偏移
ICLR 2026 Poster accepted paper at ICLR 2026. Interactive segmentation uses real-time user inputs, such as mouse clicks, to iteratively refine model predictions. Although not originally designed to address distribution shifts, this paradigm naturally lends itself to such challenges. In medical imaging, where distribution shifts are common, interactive methods can use user inputs to guide models towards improved predictions. Moreover, once a model is deployed, user corrections can be used to adapt the network parameters to the new data distribution, mitigating distribution shift. Based on these insights, we aim to develop a practical, effective method for improving the adaptive capabilities of interactive segmentation models to new data distributions in medical imaging. Code/project link: https://github.com/WenTXuL/OAIMS
论文ICLR 2026 Poster2026 年clinical prediction CRONOS:4D 医学纵向序列的连续时间重建
ICLR 2026 Poster accepted paper at ICLR 2026. Forecasting how 3D medical scans evolve along time is important for disease progression, treatment planning, and developmental assessment. Yet existing models either rely on a single prior scan, fixed grid times, or target global labels, which limits voxel-level forecasting under irregular sampling. We present CRONOS, a unified framework for many-to-one prediction from multiple past scans that supports both discrete (grid-based) and continuous (real-valued) timestamps in one model, to the best of our knowledge the first to achieve continuous sequence-to-image forecasting for 3D medical data. CRONOS learns a spatio-temporal velocity field that transports context volumes toward a target volume at an arbitrary time, while operating directly in 3D voxel space.
论文ICLR 2026 Poster2026 年medical LLM agent K-Prism:知识引导与提示融合的通用医学图像分割模型
ICLR 2026 Poster accepted paper at ICLR 2026. Medical image segmentation is fundamental to clinical decision-making, yet existing models remain fragmented. They are usually trained on single knowledge sources and specific to individual tasks, modalities, or organs. This fragmentation contrasts sharply with clinical practice, where experts seamlessly integrate diverse knowledge: anatomical priors from training, exemplar-based reasoning from reference cases, and iterative refinement through real-time interaction. We present $\textbf{K-Prism}$, a unified segmentation framework that mirrors this clinical flexibility by systematically integrating three knowledge paradigms: (i) $\textit{semantic priors}$ learned from annotated datasets, (ii) $\textit{in-context knowledge}$ from few-shot reference examples, and (iii) $\textit{interactive feedback}$ from user inputs like clicks or scribbles. Code/project link: https://github.com/bangwayne/K-Prism
论文ICLR 2026 Poster2026 年clinical prediction 利用潜在流匹配学习患者特异疾病动力学用于纵向影像生成
ICLR 2026 Poster accepted paper at ICLR 2026. Understanding disease progression is a central clinical challenge with direct implications for early diagnosis and personalized treatment. While recent generative approaches have attempted to model progression, key mismatches remain: disease dynamics are inherently continuous and monotonic, yet latent representations are often scattered, lacking semantic structure, and diffusion-based models disrupt continuity through the random denoising process. In this work, we propose treating disease dynamics as a velocity field and leveraging Flow Matching (FM) to align the temporal evolution of patient data. Unlike prior methods, our approach captures the intrinsic dynamics of disease, making progression more interpretable.
论文ICLR 2026 Poster2026 年clinical prediction 能否用 LLM 为临床时间序列数据生成可迁移表征?
ICLR 2026 Poster accepted paper at ICLR 2026. Recent advances in vision-language models (VLMs) have achieved remarkable performance on standard medical benchmarks, yet their true clinical reasoning ability remains unclear. Existing datasets predominantly emphasize classification accuracy, creating an evaluation illusion in which models appear proficient while still failing at high-stakes diagnostic reasoning. We introduce Neural-MedBench, a compact yet reasoning-intensive benchmark specifically designed to probe the limits of multimodal clinical reasoning in neurology. Neural-MedBench integrates multi-sequence MRI scans, structured electronic health records, and clinical notes, and encompasses three core task families: differential diagnosis, lesion recognition, and rationale generation. Code/project link: https://neuromedbench.github.io/
论文ICLR 2026 Poster2026 年trustworthy medical AI AbdCTBench:从腹部表面几何学习临床生物标志物表征
ICLR 2026 Poster accepted paper at ICLR 2026. Body composition analysis through CT and MRI imaging provides critical insights for cardio-metabolic health assessment but remains limited by accessibility barriers including radiation exposure, high costs, and infrastructure requirements. We present AbdCTBench, a large-scale dataset containing 23,506 CT-derived abdominal surface meshes from 18,719 patients, paired with 87 comorbidity labels, 31 specific diagnosis codes, and 16 CT-derived biomarkers. Our key insight is that external surface geometry is predictive of internal tissue composition, enabling accessible health screening through consumer devices. We establish comprehensive benchmarks across seven computer vision architectures (ResNet-18/34/50, DenseNet-121, EfficientNet-B0, ViT-Small, Swin Transformer-Base), demonstrating that models can learn robust surface-to-biomarker representations directly from 2D mesh projections. Code/project link: https://abdctbenchrepo.github.io/AbdCTBench/
数据资源abdominal CT and MRI with multi-organ annotationsabdominal multi-organ segmentation benchmarkAMOS 2022 challenge benchmark; see official Grand Challenge page申请访问 AMOS 腹部多器官分割基准
AMOS is an abdominal multi-organ segmentation benchmark with CT and MRI cases for evaluating versatile medical image segmentation models. It supports abdominal organ segmentation, modality-general segmentation, and benchmarking of robust 3D segmentation methods.
数据资源cine cardiac MRI with segmentation labelscardiac MRI segmentation datasetACDC challenge dataset; see official database page申请访问 ACDC 自动心脏诊断挑战数据集
ACDC is a cardiac MRI dataset for automated cardiac diagnosis and segmentation. It supports left and right ventricular segmentation, myocardium segmentation, cardiac function quantification, and evaluation of robust cardiac image analysis methods.
数据资源MRI, DXA, ultrasound, retinal imaging, genetics, and health recordspopulation-scale multimodal imaging cohortPopulation-scale UK Biobank imaging cohort; application required申请访问 UK Biobank 影像数据
UK Biobank Imaging provides large-scale imaging phenotypes linked to genetic, lifestyle, and health outcome data. It is used for population-scale medical imaging AI, disease risk prediction, representation learning, multimodal biomedical modeling, and epidemiological AI studies.
数据资源brain MRI with demographic and clinical variablesbrain MRI and neuroimaging dataset collectionOASIS cross-sectional and longitudinal releases; see official site开放访问 OASIS 脑 MRI 与神经影像数据集
OASIS provides open-access neuroimaging datasets for studying normal aging, dementia, and brain structure. It is useful for brain MRI segmentation, age prediction, dementia classification, longitudinal modeling, and neuroimaging method benchmarking.
数据资源MRI, PET, biomarkers, clinical and cognitive assessmentslongitudinal neuroimaging and clinical datasetLongitudinal ADNI cohort data; access through ADNI/LONI申请访问 ADNI 阿尔茨海默病神经影像倡议数据集
ADNI provides longitudinal neuroimaging, biomarker, clinical, and cognitive data for Alzheimer disease research. It supports disease progression modeling, dementia diagnosis, multimodal prediction, biomarker discovery, and clinical translation studies.
数据资源raw MRI k-space and reconstructed MRI dataMRI reconstruction datasetLarge raw MRI reconstruction dataset; see official site申请访问 fastMRI 原始 MRI 重建数据集
fastMRI is a raw MRI dataset for accelerated magnetic resonance image reconstruction, originally released by NYU Langone Health and Meta AI. It is used for MRI reconstruction, compressed sensing replacement, generative reconstruction, and robustness evaluation.
数据资源multimodal brain MRI with tumor annotationsbrain tumor MRI segmentation challenge datasetBraTS 2024 challenge dataset; see Synapse project申请访问 BraTS 2024 脑肿瘤分割挑战数据集
BraTS 2024 provides multimodal brain MRI data and expert annotations for brain tumor segmentation and related tumor subregion analysis. It is a major benchmark for glioma segmentation, radiology AI, and robust multimodal MRI segmentation methods.
数据资源CT/MRI分割基准10 segmentation tasks开放访问 Medical Segmentation Decathlon 医学分割十项全能
Legacy multi-task biomedical image segmentation benchmark retained as a reference; newer segmentation benchmarks are listed above it.
数据资源Biomedical imagesTool/modelFoundation model and code开放访问 BiomedParse 生物医学图像解析基础模型
Foundation model and toolkit for all-in-one biomedical image parsing across recognition, detection, and segmentation tasks.
数据资源医学影像分割基准IMed-361M / IMIS-Bench开放访问 IMed-361M / IMIS-Bench 交互式医学图像分割基准
Interactive medical image segmentation benchmark and baseline from CVPR 2025, covering multiple modalities, organs, and target structures.
技术竞赛Spring 2026 launch announced; exact deadline TBAMRI 加速膝关节 MRI RSNA 2026 膝关节 MRI AI 挑战
RSNA announced a 2026 AI challenge focused on accelerating knee MRI examinations and reconstruction quality.